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TWO COMPONENTS IN BIPOLAR SCALES: 
DIRECTION AND EXTREMENESS ! 


DEAN PEABODY ? 


Columbia University 


Various measures in psychology use 
a set of response categories extending 
in two directions about some neutral 
point. Scores for 
are summed or averaged to give a 


several responses 


composite score. Examples would in- 
clude some of the methods of absolute 
judgment in psychophysics, the Likert- 
type of attitude scale, Osgood’s se- 
mantic differential, rating scales used 
in personality assessment. The ob- 
ject of measurement for the composite 
scores may thus be some attribute of 
physical stimuli, of verbal concepts, 
of other persons, or of oneself. 

components can be distin- 
guished in the responses: (a) the di- 
rection from the neutral point, repre- 
dichotomy—heavy 


Two 


senting the basic 
versus light, agreement versus disa- 
greement with the statement, etc.; (>) 
the degree or extremeness of the re- 
sponse from the neutral point. For 

1 This report revision of a 
doctoral dissertation at Harvard University. 
The collection of data in England was aided 
by a grant from the Social Science Research 
Council; the replication in the United States 
by a grant from the Laboratory of Social 
Relations at Harvard. Support for rewrit- 
ing was given by the Council for Social 
Sciences at Columbia. I am _ particularly 
grateful to G. W. Allport and F. Mosteller 
for general and statistical advice. 

2 Now at the University of Oregon. 


represents a 


example, in a six-point Likert scale the 
subject writes + 1, + 2, +3; or —1, 
— 2, — 3 to indicate slight, moderate, 
and strong agreement or slight, mod- 
erate, and strong disagreement. The 
basic dichotomy of direction is deter- 
mined by whether he agrees or dis- 
agrees (writes + or —); extremeness 
by whether he indicates slight, mod- 
erate, or strong feelings (writes 1, 2, 
or 3). 

A neutral response category may or 
may not be provided—e.g., Likert 
scales traditionally permitted a neu- 
tral response (writing 0), but many 
recent Likert scales do not. 

When scores for several responses 
are summed or averaged to give a 
composite score, the two components 
become: the number (more generally, 
the proportion) of responses scored in 
each direction; the mean extremeness 
of the responses in each direction. 

The present paper derives a method 
for estimating the relative contribution 
of these two components to composite 
scores, and applies the method to data 
from several Likert attitude scales. 

A possible reaction to the preceding 
might be to grant that the two com- 
ponents described can logically be dis- 
tinguished, but question whether this 
is necessary or desirable. The purpose 
of using an extended scale is to in- 
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crease the refinement of measurement 
over that given by dichotomous _re- 
What then is the purpose of 
separating direction and extremeness 
components? The answer is that there 
are both theoretical and practical rea- 
sons for considering the components 
separately : 

1. The theoretical reason is that the 
distinction may be a psychological as 
well as a logical one. Ideally, the di- 
rection component would represent a 
qualitative aspect of the object of meas- 
urement, while the extremeness com- 
ponent would represent quantitative, 
intensity, aspects. In reality, both 
components are likely to represent, at 
least in part, response tendencies of the 
subjects rather than aspects of the 
object of measurement. Cronbach 
(1946; 1950, p. 21) distinguishes the 
response sets possibly involved in the 
two “ac- 
quiescence” compo- 


sponses. 


components as “bias” or 
for the 
nent, and “tendency to go to extremes” 
for the 
seems desirable to examine these dif- 
ferent factors separately. 
ample, Cronbach (1950) argues 
strongly that the extremeness com- 
ponent may largely represent a re- 
sponse set: 

The usual argument for the more finely 
divided scale of judgment on each attitude 
item is that it is more reliable and that 
subjects prefer it. If the latter advantage 
is significant, the finer scale may be retained 
and scored dichotomously. The argument 
that the finer scale gives more reliability is 
not a sound one, since this is precisely what 
we would expect if all of the added reliable 
variance were response-set 
had no relation to beliefs about the attitude- 
object in question (p. 22) 


direction 


extremeness component. It 


For ex- 


variance and 


2. The practical reason for examin- 
ing the components separately 
concerns the justification for dichotom- 
ous scoring. Regardless of the the- 
oretical status of the two components, 
if the extremeness component is little 


two 


represented in whatever the composite 
measures, 
scoring method based on the direction 
dichotomy would seem justified. 


score use of the simpler 


DEVELOPMENT OF AN APPROXIMATION 
FOR THE VARIANCE OF COMPOSITE 
ScorRES IN TERMS OF DIRECTION 

AND EXTREMENESS SCORES 


The development given below as- 
sumes that a neutral response category 
is not provided for. Provision for a 
neutral category would give a closely 
analogous development. 

The composite score used here rep- 
resents the average of scores for the 
separate responses. An _ individual’s 
composite score can be expressed 
mathematically in terms of direction 
and extremeness scores: 

C=PX-—-(1-P) Y [1] 
where: 

C is the composite score. 

P is the proportion of responses in the 
direction scored positively.* 

X is the mean extremeness of responses 
in the direction scored positively. 

Y is the mean extremeness of responses 
in the direction scored negatively. 


For example, a subject makes 10 


responses as follows: +2, +2, + 2, 
+2, +3, +1, -—1, -1, -—2, —3. 
If the responses are scored as they 
stand, then P = .60, X = 2.00, Y 
1.75, and C = + .50. 

Frequently, to avoid negative num- 
bers some constant is added to all 
scores. This simply raises the com- 
posite score and does not affect the 
development given below. 

The present problem now becomes : 
to what extent does variation in the 
composite scores (C) depend on di- 
rection (P), and to what extent on 
extremeness (X and Y)? 

3If a neutral response category were 
allowed, then Equation 1 should be modified 
to replace 1— P by Q=the proportion of 
responses in the direction scored negatively. 
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The variance of a difference in this 
se can be written as: 


- Upy + are | 


>») 


T 
—2 cov [PX, (1-P)Y] 


[2] 


The terms on the right involve the 
variances and covariance of products. 
There seems to be no simple method 
of handling this situation. The method 
used here is to approximate the terms 
on the right side of Equation 2 by the 
delta method.* In the present case, 
the assumption is that the deviations 
of individual scores around the group 
means are relatively small compared 
to the means, so that higher powers 
of the deviations can be neglected. 

Let Mp, My, and My stand for 
the group P, X, and Y. 
These as constants re- 
individual 


means of 
act 


variations 


means as 


gards between 
scores, 

Let Ap, Ax, Ay stand for individual 
around the group 


deviations means: 


P = Mp+ Ap 
X=My+Ar 
Y = My + dy 


These deviations are assumed to be 
small compared to the means. 

The method will be illustrated for 
the first term on the right of Equation 
2, opx’. 

sy the definition of any variance: 


E[(PX)?] — [E(PX)]* [3] 


opx” 
Applying the delta method: 
PX = MpM, + ApMy 


+ArMp+ApAxr [4] 


In taking expected values, the terms 
involving the first order of A vanish, 
since they are deviations around a 
mean, with an expected value of zero. 
The expected value of ApAr is the 


4I am indebted to F. Mosteller for first 
working out the derivation leading from 


Equation 2 to Equation 10. 
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covariance Av). Hence: 


E( PX) 


(Ap, 
MpM x 4 Top. ¥)7pox [5] 
In squaring Equation 4 the terms 
are collected according to the order 
of A involved: 
(Pay = 
0 order of A: 
1 order of A: 


M,p?M x? 
+ 2ApM pM x’ 
+ 2ArMp*My 
+ Ap?M x? + Ax?M p’ 
+ 4ApAxrM pM x 


These terms will 


order of A: 


order of A: 
4 order of A: {be neglected. 


In taking expected values, the first 
order terms again vanish. The third 
and fourth order terms are assumed 
to be negligible. The result is: 

E| ( PX )?] — M,’M x’ + op’M x? 


+o Y 2M p? + 4r, P.X opayM pM xy [6] 


Substituting from Equations 5 and 
6 in Equation 3 gives: 


= op’M x? + ox’M p’ 
+ 2r P.X opaxM pM x 


opx 


[7] 


Equation 7 gives an approximation 
for the first term on the right of Equa- 
tion 2 in terms of the means, variances, 
and covariance of P and X. 

For the second term on the right of 
Equation 2, an analogous application 
of the delta method gives the approxi- 
mation: 


o-pyy = op*My? + oy?(1 — Mp,? 


—_ 2r, P.y)opoy ( | —_ Mp ) My 


— 7 Px \op’ox” 


[8] 


= yr, P.Y \op'ay” 


For the third term on the right of 
Equation 2, the approximation is: 


cov [PX, (1— P)Y] = — op*MxMy 
+ fr, X.Y oxayM p( 1 — Mp ) 
— r.p.xopaxM pMy 
+ Yripy opoyM x (1 mee Mp) 
+ P.X)T.P.Y \op'o yoy 


[9] 
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Substituting in Equation 2 from the 
approximations 7, 8, and 9 and col- 
lecting terms gives as the overall ap- 
proximation : 


I Il, 
op" ( Vy + My > + aox’*M p* 
Il, 
T oy*(1 — M p)? 
II. 
2r Y y oxoyM pl 1 - Mp) 
III, 
+ 2r PX opaxM p( My + My ) 
Ill, 
2 2r P.y)opoy ( 1 — Mp) (My + My ) 
III, 


- inaienk> 


r.pyopay|* [10] 

Roman numerals are written above 
the terms to indicate the grouping 
that is relevant for present purposes : 
Term I involves variation of P ; Terms 
II,, II,, and II, involve variation of 
X and Y; Terms III,, IIIy, and III, 
involve the covariation of P with X 
and Y. 

(The final Term III, involves 
higher order of the variables than the 
other terms, and will often be neg- 
ligible. The correlations, and the 
standard deviation of proportion scores 
are necessarily less than 1. The stand- 
ard deviations of X and Y also tend 
to be less than 1 if the scale has no 
more than three steps in each direction 
—i.e., a scale of seven steps or less. In 
such cases the final Term III. tends to 
be negligible. For scales that extend 
further, the term might not be neg- 
ligible. ) 


a 


The approximation given by Equa- 


tion 10 can be used to estimate the 
relative contribution of direction and 
extremeness scores to the composite 
scores on different types of bipolar 
Comparison can be made of 
three groups terms, and in 


scales. 


the of 


PEABODY 


particular, the relative size of Term 
I to the combined Term II. 

to Data S1x- 
ATTITUDE 


FROM 
SCALES 


APPLICATION 
Step LIKERT 


This section applies the approxima- 
tion just developed to data from six- 
step Likert attitude scales. For each 
item, the subject chooses between 
three degrees of agreement by marking 
+1, +2, +3; or three degrees of 
disagreement by marking —1, — 2, 
— 3. Hence X and Y must lie be- 
tween 1 and 3, while the proportion 
scores P lie between O and 1. 

Before taking up actual data, it is 
possible to get a general idea of the 
likely results. If one assumes for ex- 
ample that the means of P, X, and Y 
are near the middle of their possible 
values, then Mp will be around .5, 
and My and My around 2. In this 
case : 


Term I is about 16 ap’. 
Term II, is about .25 oy”. 
Term II, is about .25 oy’. 


Term II, is about — 57, y,oyoy. 


Then with the standard deviations 
still to be evaluated, Term I has an 
advantage of 64 to 1 over Term II, 
or Term II,. If one assumes further 
that the o’s are likely to be roughly 
in the ratio of their possible ranges, 
then ay and oy would be about twice 
as large as op. This cuts down the 
advantage of Term I over Term II, 
or II, to 16 to 1, or an advantage of 
8 to 1 over both combined. Insofar 
as the correlation r,x, y) is positive, 
the combined Term II would be made 
even smaller by Term II,. 

While in principle, data from any 
set of items could be used, two points 
deserve consideration. 

1. If the items were such that sub- 
jects tended to respond in the same 
direction on nearly all items, then 





Two CoMPONENTS IN BIPOLAR SCALES 


TABLE 1 


Basic DATA FOR DIRECTION AND EXTREMENESS SCORES 


Direction: P 
Mp 
op 
Rel. of P 





Extremeness: - 
Mx 
ox 


Rel. of X 


My 
oy 


Rel. of Y 


Correlations 
YP, X) 
rcP.Y) 

r(x, Y) 


X or Y would be based on few 
items would tend to be an un- 
stable measure.°® 

2. To give a representative test, a 
variety of types of scales should be 
used. In particular, the test should 
include both the recent type of scale 
deriving from The Authoritarian Per- 
sonality where all items are worded 
in one direction and agreement is al- 
ways scored positively, and the more 
traditional type of Likert scale where 
items are worded in both directions 
and agreement is sometimes scored 
positively and sometimes negatively. 

Accordingly results are presented 
for four sets of items: 


either 
and 


1. Responses to 28 items from the F 
Scale of “authoritarianism” (Adorno, Fren- 
kel-Brunswik, Levinson, & Sanford, 1950, 


5In the limiting case, if a subject re- 
sponds to all items in one direction, either 


his X or his Y score becomes indeterminate 


Dogmatism 





Scale 


General 
Agreement 


(balanced) 
Conservatism 











pp. 255-257, 260—omitting Items 22 and 
30/28 
2. Responses to the 40 items of Rokeach’s 
Dogmatism Scale, Form E (Rokeach, 1956, 
pp. 7-10) 
In both of 


all items are 
agreement is 


these scales 
worded in one direction and 
always scored positively. Hence 
represent a mixture of attitudes toward 
Authoritarianism or Dogmatism with an 
agreement response tendency (“acquiescent 
response set”). 

3. Responses to 32 items referring to 
conservative versus more leftist positions 
on economic and political issues. Sixteen 
of the statements present a conservative 
position and agreement is scored positively. 
The remaining 16 statements present the 
same issue in the reverse direction (the two 
versions are given at different times) and 
agreement is scored negatively. Hence the 
combined score should cancel major effects 
of acquiescent response set and give a fairly 
pure measure of attitudes toward Conserva- 
tism. 

4. Responses to 208 items, scored for 
“General Agreement.” The items include 
originals and reversals of the F and Dog 


scores 
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TABLE 2 


CONTRIBUTION TO VARIANCI 


OF COMPOSITE SCORES BY DIRECTION 


AND EXTREMENESS SCORES 


IIIf, 


Il combined 
III combined 


otal: | + 11 + III 
Actual oc? 
% discrepancy 


% of Total: 
I: Variance of P 
Il: Variance of 
X&Y 
Il1: Covariance of P 
with X & } 


matism items, as well as for 20 items from 
the Anti-Semitism scale (Adorno et al. 
1950, pp. 68-70); and both versions of the 
Conservatism items just mentioned. Agree- 
ment is scored positively for all items. 
Hence the combined should cancel 
major effects of attitude content and give a 
measure of generalized “acquiescent response 
set.” 

The subjects were 88 American and 75 
English engineering college students. Re- 
sults for the American sample are headed 
“US”; results for the English sample “UK.” 

Further details of procedure are 
elsewhere (Peabody, 1961). 


score 


given 


Table 1 gives the means, standard 
deviations, reliabilities,¢ and correla- 
tions for the direction and extremeness 


scores P, X, and Y. 


6 All reliabilities are for split halves sepa- 
rated by a temporal interval, usually 2 


weeks 


Dogmatism 


General 
Agreement 


balanced) 
Conservatism 


O11 
024 


000 
008 


007 
008 


071 
066 


"OF 
‘Yo 


Table 2 gives the evaluation of the 
terms of the approximation Equation 
10. (The final Term III, is omitted 
here since of negligible size. ) 

The closeness of the approximation 
is given in Table 2 by the “% discrep- 
ancy” between the sum of the terms 
and the actual variance of the com- 
posite scores, and ranges from around 

% to 8%. This result is about what 
would be expected from this type of 
approximation. 

Although the results in Table 2 
differ in detail from the rough a priori 
estimate made of the relative impor- 
tance of Terms I and II, the final con- 
clusion is about the same. The aver- 


age result for the two samples on each 


set of items gives the following com- 
position of the variance of the com 
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posite scores: 


About 70%-80% involves variation of 


direction scores—Term I 

About 10% involves variation of extreme- 
ness scores—Term II 

About 10%-20% involves covariation of 
direction scores with extremeness scores— 


Term III 


This is a ratio of 7-8 to 1 for the 
first two terms. 

Two main factors seem responsible 
for this result : 

1. From the general form of Equa- 
tion 10 variation in one component is 
weighted by the second power of the 
mean(s) from the other component 
(i.e., whether a person agrees or dis- 
agrees with an item will usually have a 
larger effect on his composite score 
than the degree of his agreement or 
disagreement). As in the a priori 
estimate, these weights overwhelm- 
ingly favor the direction variance term 
and more than compensate for the 
smaller size of this variance itself— 
so that Term I is considerably larger 
than any of the Terms II. 

2. The correlation between extreme- 
ness scores in the positive and nega- 
tive directions is always positive. 
Those who use large positive numbers 
(strongly agree) tend to use large 


negative numbers (strongly disagree). 
Hence Term II, cancels much of the 
effect of the extremeness component. 
This factor becomes particularly prom- 
inent for the General Agreement scores 
where the variance of direction scores 
is relatively small (presumably due to 
the balancing out of content), but the 
correlation between X and Y becomes 
very substantial. 

A simpler if less rigorous approach 
is to consider the more practical ques- 
tion: how good a substitute for what- 
ever is measured by the composite 
scores can be given by direction or by 
extremeness scores? Table 3 gives the 
relevant correlations together with the 
reliabilities of the composite scores. 

The correlations given in Table 3 
are representative of the values found 
for a wider variety of scales. The 
correlations of composite scores with 
direction scores tend to be around .90 
and above. The correlations with the 
extremeness scores are more variable 
and substantially lower. 

The results show fairly clearly that 
composite scores on these Likert at- 
titude items reflect primarily the direc- 


tion of responses, and only to a minor 


extent their extremeness. The prac- 


tical conclusion suggested is that there 


TABLE 3 


DATA FOR COMPOSITE SCORES 


Reliabilities and correlations 


Reliability of C scores 


Correlations of C scores: 
with Direction scores 
ric, P) 
with Extremeness scores 
ricx 


ray 


General 
Agreement 


(balanced) 


Dogmatism . - 
_ Conservatism 
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is justification for scoring such items 
dichotomously according to the direc- 
tion of response. The resulting score 
will closely reflect whatever the com- 
posite score would. 

Further applications are necessary to 
determine whether these conclusions 
would extend to scales where a neu- 
tral category is permitted, to scales 
with more than six or seven steps, and 
to scales that do not concern attitudes. 
As regards more extended scales (e.g., 
of 9 or 11 steps) the reasoning used 
with regard to six-point scales would 
imply that the extremeness component 
might play a more significant role. 
However, in view of the prevalence in 
psychology of scales with no more 
than six or seven steps, the present 
results may be widely applicable. 


DISCUSSION 


Extremeness scores are quite reli- 
able if based on a large enough number 


of responses, and are to a considerable 
extent independent of direction scores. 
But although extremeness scores can 
be stable measures of something, they 
are largely unrepresented in the usual 


composite scores. Hence the hope that 
Likert scale scores for example should 
be “influenced by the degree to which 
subjects favor or oppose attitude state- 
(Newcomb, 1950, p. 171) is 
largely unfulfilled in practice. 
Extremeness scores will tend not to 
have the same correlates as the com- 
posite scores, but could have correlates 
of their own. Only a few studies using 
separate measures of extremeness give 


ments” 


some evidence of what these correlates 
may be (e.g., Osgood, Suci, & 
nenbaum, 1957, pp. 226-236; Soueif, 
1958). In particular, the general con- 
clusion that extremeness is little re- 
flected in whatever the composite 
scores measure applies to the F Scale 
of “authoritarianism.” This 
worth noting in view of the recurrent 


Tan- 


fact is 


attempts to interpret extremeness as 
“intolerance of ambiguity,” implying 
that extremeness ought to relate to 
composite scores on the F Scale— 
most recently Mogar (1960) uses ex- 
tremeness on the semantic differential 
as a criterion for composite F Scale 
scores. 

What is the nature of extremeness 
scores themselves? Two possibilities 
have already been suggested : extreme- 
ness scores may represent actual dif- 
ferences in intensity atti- 
tudes), or they may only represent 
differences in using the response scale 
(i.e., response set). Differences with- 
in an individual in the extremeness 
of responses to different items pre- 
sumably represent primarily  differ- 
ences in intensity, although even here 
the individual may change his usage 
of the response scale over time. 

On the other hand, as regards dif- 
ferences between individuals in aver- 
age extremeness, a different interpre- 
tation is suggested by the correlations 
between extremeness scores found in 
the study reported here. Extremeness 
seems to be a very general individual 
characteristic : correlations between ex- 
tremeness scores in both directions for 
all possible sets of items (including 
originals and reversals of the F, Dog- 
matism, Anti-Semitism, and Conserva- 
tism items) tend all to be positive, 
most of them substantially so. Thus, 
a person who shows relative extreme- 
ness in taking an “authoritarian” posi- 
tion when agreeing with F Scale orig- 
inals or disagreeing with 
tends also to show relative extreme- 
ness in taking an “anti-authoritarian” 
position when disagreeing with F 
Scale originals or agreeing with re- 
versals, etc. It does not seem very 
plausible to assume that individual 
differences in the intensity of attitudes 
show such systematic generality for 
But 


(e.g., of 


reversals, 


responses in opposite directions. 
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the size of the numbers subjects ac- 
tually write does show such generality. 
Hence differences in extremeness be- 
tween persons seem to represent pri- 
marily response sets characteristic of 
the individual—consistent tendencies 
to use extreme or moderate response 
categories. While some evidence sug- 
gests a secondary tendency for ex- 
tremeness scores to represent differ- 
ences in intensity,’ the primary factor 
—the very great generality of extreme- 
ness scores—seems better interpreted 
as response set. 


SUMMARY 


Scores on the bipolar scales in fre- 
quent use in psychology can be ana- 
the pro- 
portion of responses in each direction 


lyzed into two components: 


of the basic dichotomy, and the ex- 
tremeness of responses in each direc- 
An approximation is developed 
expressing variation in the usual com- 
posite scores in terms of variation in 
the two components of direction and 
extremeness. 

The approximation is applied to four 
of items from Likert attitude 
scvles. The results indicate that com- 
posite scores reflect primarily the di- 
rection of responses, and only to a 
minor extent their extremeness. The 
practical implication is that there is 


tion. 


sets 


7 For example, correlations tend to be posi- 
tive between P and X, and negative be- 
tween P and Y (as in Table 1). This indi- 
cates some tendency to use higher or lower 
positions on the response scale as a whole, 
as would be expected if intensity were a 
factor. However, this tendency should lead 
to negative correlations between X and JY, 
and is in every case overcome by the 
opposing tendency for extremeness scores 
to correlate positively. 
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justification for scoring the items 
dichotomously according to the direc- 
tion of response, 

Since extremeness scores are reli- 
able and are largely unreflected in 
the usual composite scores, they may 
have quite different correlates of their 
own. Individual differences in aver- 
age extremeness show wide generality 
across different extremeness scores. 
This generality extends to responses in 
opposing directions—suggesting that 
the differences primarily represent re- 
sponse sets, and only to a secondary 
degree actual differences in intensity. 
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THE LATERAL HYPOTHALAMIC SYNDROME: 


RECOVERY OF FEEDING AND DRINKING AFTER 
LATERAL HYPOTHALAMIC LESIONS! 


PHILIP TEITELBAUM ano ALAN N. EPSTEIN ? 


University of Pennsylvania 


. 


The brain controls feeding behavior. 
If the base of the brain is damaged 
bilaterally in the ventromedial region 
of the hypothalamus, rats and other 
overeat and become tre- 
obese (Anand, Dua, & 
1955; Brobeck, Tepper- 


mammals 
mendously 
Schoenberg, 


man, & Long, 1943; Wheatley, 1944). 
Damage in the lateral hypothalamus 
produces the opposite effect: the ani- 


mals do not eat and will die in the 
presence of food and water (Anand 
& Brobeck, 1951; Anand, Dua, & 
Schoenberg, 1955). Electrical stim- 
ulation through chronically implanted 
electrodes in the medial hypothalamus 
suppresses feeding, and stimulation in 
the lateral hypothalamus: elicits it 
(Briigger, 1943; Larsson, 1954; Wy- 
rwicka & Dobrzecka, 1960). 

These dramatic and consistent phe 
nomena suggest that a dual hypo- 
thalamic mechanism controls food in- 
Cessation of eating (satiety) is 
controlled by a medial 


take. 
said to be 


1 This work was supported by NSF Grant 
G-9792 and by USPHS Grant 2 b-5273. A 
preliminary report was presented at the 
127th meeting of the AAAS in New York 
on December 29, 1960. 

The authors are grateful to Eliot Stellar 
for his generous advice and material sup 
port throughout this work. Both he and 
R. Duncan Luce made valuable criticisms 
of the manuscript. Malcolm M. Cohen and 
Jay J. Singer gave valuable technical as 
sistance. 

2 Formerly at the Institute of Neurolog 
ical Sciences, Medical School, University of 
Now at the Zoological Lab- 
Biology at the uni- 


Pennsylvania 
oratories, Division of 
versity. 


neural system whereas active feeding 
is maintained by a system in the lateral 
hypothalamus (Anand & Brobeck, 
1951). 

There have been two major de- 
velopments since the original descrip- 
tion of the lateral starvation phenom- 
enon by Anand and Brobeck in 1951 
which suggest a reinterpretation of the 
functions of the lateral hypothalamic 
system. First, not only do the ani- 
mals refuse to eat, but they also do not 
drink (Teitelbaum & Stellar, 1954). 
The adipsia was emphasized by 
Montemurro and Stevenson (1957). 
They reported three rats with lateral 
hypothalamic lesions that did not drink 
water but did eat appreciable quanti- 
ties of a high-fat diet, maintaining 
their weight for as long as 55 days 
when tube-fed with 10% glucose solu- 
tion or water. Without hydration, 
the animals ate less and lost weight. 
This was the first demonstration that 
lateral hypothalamic lesions in the rat 
could produce adipsia without aphagia. 
The more recent finding that hyper- 
tonic saline injected directly into the 
lateral hypothalamus can elicit drink- 
ing as well as eating (Epstein, 1960a) 
is consistent with this. 

Second, the refusal to eat and drink 
produced by these lesions is not always 
permanent, and when recovery occurs, 
it follows a regular and orderly course. 
Teitelbaum and Stellar (1954), work- 
ing with large lateral hypothalamic 
lesions that produced both aphagia 
and adipsia, kept their rats alive by 


tube-feeding. Without maintenance 
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these animals would have starved to 
death, but when kept alive they even- 
tually began to eat and drink again. 
The animals first accepted wet and 
palatable foods and began to maintain 
their weight, then drank water, and 
finally, regulated their weight on 
ordinary dry food and water, and 
seemed normal. 

We will show that lateral hypo- 
thalamic lesions produce a complex set 
of symptoms—a syndrome of deficits 
in drinking as well as in feeding be- 
havior from which the animals recover 
at different rates. In 35 rats (34 
females, 1 male), we have produced 
bilateral lateral hypothalamic lesions 
of varying sizes using a standard ster- 
eotaxic technique (Stellar & Krause, 
1954). The coordinates used were: 
A 6.0, RL 2.0, and 8.0 down from the 
dural 3y offering the ani- 
mals a variety of wet and palatable 
foods * in addition to the ordinary dry 
Purina laboratory chow pellets and 


surface. 


water, and by keeping them alive by 


tube-feeding if mecessary, we have 
identified four clear-cut stages in the 
recovery from lateral hypothalamic 
lesions. The stages always follow the 
sequence and are defined by 
critical behavioral events that occur in 
succession during the recovery proc- 
ess. Each of the stages has been 
analyzed experimentally in an attempt 
to identify the physiological deficits 
underlying it. 


same 


A liquid diet was prepared by mixing 
75 milliliters of whole egg; 125 milliliters 
of 50% sucrose solution; 240 milliliters of 
evaporated milk; 0.3 milliliters Poly-Vi-Sol 
(multiple vitamin preparation of Mead- 
Johnson Co., Evansville, Indiana); and 30 
milliliters Kaopectate (Upjohn Co., Kala- 
mazoo, Michigan). One part 10% formalin 
in 100 milliliters of diet was added as a 
preservative. Milk chocolate refers to finely 
divided Hershey’s milk chocolate. A_ wet- 
mash was prepared by mixing Purina Lab- 
oratory Chow Powder in equal parts with 
water 


The animal whose record is graphed 
in Figure 1 had large symmetrical 
lesions which resulted in a long, slow 
recovery, thus making it possible to 
see the events in each stage quite 
clearly. 


STAGES IN RECOVERY 
Stage I: Aphagia and Adipsia 


The animals refuse all food, wet or 
dry, and they do not drink any water. 
Without tube-feeding, they will die. 
The animal of Figure 1 neither ate 
nor drank for 19 days, steadily lost 
weight, and had to be kept alive from 
the 7th day on by gastric intubation 
with a liquid diet. The behavior of 
such a rat is quite characteristic. It 
is generally inactive, but it is alert and 
can be easily roused to movement. It 
walks with a slightly hunched posture, 
carrying its head low to the ground. 
It typically ignores food and_ will 
actively push away food that is forced 
upon it. If such a rat gets wet food 
on its paws or face, it engages in 
vigorous bilateral shaking of the front 
paws ‘which somewhat resembles the 
paw- and face-grooming movements 
of the normal rat except that there is 
no grooming with the mouth. When 
its paws are wet, it wipes them re- 
peatedly on the floor of the cage, and 
if the face and mouth have food or 
water on them, it walks along and rubs 
its chin and the sides of its face on the 
cage floor. It does not groom itself 
effectively, allowing its fur to become 
completely matted and caked with wet 
food. It actively resists having milk 
placed in its mouth by a medicine 
dropper, and it does not swallow the 
milk once it is there, but rather allows 
it to dribble out the side of the mouth. 
Ordinarily a normal rat not 
show such behavior. But it does en- 
gage in the same paw-waving and 
wiping, chin-rubbing, poor grooming, 


at eS 
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Recovery of food intake with permanent adipsia after large lateral hypothalamic 
(The first two dashed vertical lines set off stages of recovery. 
second, 


First, for 19 days, 
for 19 more days, it eats wet and palatable 


foods but still has to be tube-fed; and third, for the rest of its life, it regulates its food 


intake but does not drink water. 
ate dry food. 
eating. 


The rat was weaned to sweet nonnutritive fluid and then 
When offered only water, Shaded Area 1, it did not drink and then stopped 
Gastric hydration, Shaded Area 2, allowed normal feeding. 


Thereafter, the animal 


ingested sweet fluid and survived on dry food.) 


and rejection when very bitter quinine 
(1% weight/volume) put in its 
mouth or on its paws, face, and fur. 
This suggests that mouth contact with 
food and water is highly aversive to a 
rat with lateral lesions during this 


is 


stage. 

As such a rat loses weight each day, 
its posture becomes increasingly more 
hunched, the fur is piloerected, and its 
activity may increase. This is similar 
to the effect of starvation in a normal 
animal and may result from the dif- 
ficulty a rat has in maintaining its body 
temperature without food (Stevenson 
& Rixon, 1957). The feces of a rat 


with lateral hypothalamic lesions are 
often mucoid and may be quite loose 
and tarry. When tarry are 
present, a test for occult blood in the 
feces is usually positive.* If tube feed- 
ing is withheld and the rat dies, after 
7 to 10 days, we have often found mas- 
sive ulcers in the stomach and hemor- 
rhages in both stomach and 
(Andersson & McCann, 1956; French, 
Porter, von Amerongen, & Raney, 
1952). However, this does not seem 
to be the reason they refuse to eat and 
drink, because in some aphagic and 


stools 


cecum 


with Hematest (Ames Co 


tablets 


4 Performed 
Elkhart, Indiana) 


’ 
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adipsic animals there has been no 


macroscopic evidence of gastrointes- 
tinal pathology. 


Stage Il; Anorexia and Adipsia 


The animals eat wet and palatable 
foods, such as a liquid diet, milk 
chocolate, or cookies,® but refuse dry 
pellets and do not drink water. Al- 
though they eat appreciable quantities 
of wet and palatable foods, they do not 
eat enough to maintain their body 
weight, and they die unless tube-feed- 
ing is continued. The animal is there- 
fore anorexic. It eats but it does not 
eat enough to keep itself alive. 

The rat in Figure 1 nibbled at milk 
chocolate and liquid diet beginning on 
the 20th day, but for an additional 19 
days it did not eat enough to maintain 
its weight without tube-feeding. Along 
with these changes in feeding go many 
other changes in the animal's behavior. 
It begins to use its mouth in groom- 
ing itself and its fur gradually is 
cleaned of matted hair and dried food 
Now when fresh food is placed in the 
cage, the rat no longer ignores it com- 
The 
proaches and investigates odorous and 
palatable foods such as milk chocolate 
and the liquid diet. If the food is 
liquid, it may dip its paws in it, or 
even take an occasional lick or bite at 
it, particularly around the rim of the 
If it is a wet mash, the rat 
may paw it and scatter it around on 
the floor or nibble at it a few times. 
It often chews vigorously as it stands 
over the walks around the 
cage, yet little or no food is eaten. 
Palatable and odorous foods are most 
effective in stimulating the animal to 
investigate and eat a little, but the 
eating is not sustained. It is as though 


pletely. animal actively ap 


container. 


food « Ir 


5 Sunshine Chocolate Chip cookies (Sun- 
shine Biscuits, Inc., Long Island City, New 
York F 


the odor, taste, and other qualities of 
the diet arouse the animal's appetite, 
but there is no maintained character to 
the eating. When a hungry normal 
rat is presented with fresh palatable 
food, it gulps it down vigorously, eat- 
ing a great quantity in a few minutes. 
A lateral hypothalamic animal in this 
stage often approaches the food, 
pauses over it, and then eats in a 
rather delicate and gingerly fashion 
even though it is quite starved. The 
vigorous urge to eat seen in the nor- 
mal animal is still lacking in the ani 
mal with lateral hypothalamic lesions. 

Sut hunger gradually returns. This 
may be seen most clearly in an animal 
which exhibits prolonged anorexia and 
is kept alive by tube-feeding. Each 
day its interest in fresh food grows 
more active and its chewing and lick- 
ing more prolonged. In the intervals 
between tube-feedings, its activity be- 
gins to increase so that the animal 
sometimes displays the hyperactivity 
characteristic of a starved but other- 
wise normal rat. 
with this is a pronounced hypothermia 
(as low as 96 degree Fahrenheit in- 
stead of the normal 99 to 100 degree 
Fahrenheit) if the body weight is very 
low. The activity diminishes if the 
animal is warmed by a lamp directly 
over its cage. As the animal nibbles 
more, it needs less tube-feeding to 
maintain its weight. Then, one day, 
it eats a large quantity of food (39th 
day after operation for the animal 
shown in Figure 1), and its weight 
takes a sudden upward spurt. It is 
now no longer anorexic. 


Sometimes associated 


Stage IlI: Adipsia with a Secondary 
Dehydration-A phagia 


The rats now regulate their caloric 
intake of wet and palatable foods, but 
they still refuse to drink water 
(Williams & Teitelbaum, 1959). They 
ingest a liquid diet vigorously, and 
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their coats grow sleek as their weight 
often to 
Palatability is no longer essential for 
food intake. If these rats are trained 
to press a bar that allows a liquid diet 
to flow directly into the stomach 
through a chronically implanted gas- 
tric tube (Epstein, 1960b), they ingest 
normal amounts of food and maintain 
their weight throughout a 6-day test. 
They regulate their caloric intake nor- 
mally even when the diet is diluted 
with water. When required to press 
the bar three times for each injection 
of food, they work sufficiently harder 
to maintain the intake. This regula- 
tion of food intake without stimulation 
of the oropharyngeal receptors has 
been demonstrated in three rats dur- 
ing this stage and is similar in all 
essential respects to that seen in nor- 
mal animals feeding themselves intra- 
gastrically (Epstein & Teitelbaum, in 
The animals press the bar be- 
cause they are hungry, and the taste 
of the food is no longer necessary to 
elicit feeding. 


rises, preoperative levels. 


press ). 


The animals’ feeding behavior seems 
normal, but they do not drink water. 
However, they can be weaned from 
the liquid diet to a sweet nonnutritive 
solution. By mixing the liquid diet 
each day with an increasing propor- 
tion of the sweet fluid, they can be 
weaned first to a 10% weight by vol- 
ume (w/v) sucrose solution, and from 
this to a 0.5 or 0.2% w/v sodium 
saccharin solution. Thus, for the first 
time since the lateral hypothalamus was 
damaged, freely drinking 
nonnutritive fluid while still refusing 
As their weight 
begins to drop, they show the hy- 
peractivity of starvation, then they 
inevitably begin to eat the dry Purina 
diet and regain their weight. If the 
saccharin solution is then removed (as 
indicated in the hatched area labeled 
1 starting on the 49th postoperative 


they are 


dry food and water. 


AnD ALAN N. Epstein 


day in Figure 1), they still refuse to 
drink water, their food intake drops to 
zero in 2 or 3 days, they lose weight, 


and if prolonged sufficiently, will die. 
In 28 rats brought to this stage of 
recovery, all were weaned to sweet 
fluid and all ate dry food to maintain 
their weight in the process. For every 
animal, removal of the sweet fluid led 
to dehydration because it did not ac- 
cept water and, consequently, stopped 
eating. Restore the sweet fluid, how- 
ever, and it is accepted immediately 
(as seen in Figure 1). They eat dry 
food shortly thereafter and then re- 
gain their weight. In this stage, there- 
fore, the animals do not eat because 
they do not drink. 

The counter-experiment was per- 
formed on four rats. The saccharin 
solution was again removed, but 
water, its essential component, was 
supplied by hydrating the animals in- 
tragastrically with 2 to 3 milliliters 
every hour automatically through a 
chronically implanted stomach tube. 
As seen in Figure 1 in the hatched 
area labeled 2 starting on the 64th 
postoperative day, the animals con- 
tinue to eat dry food normally when 
hydrated, even though they no longer 
to drink. Hydration 
either by drinking sweet solutions or 
by intragastric injections allows nor- 
mal feeding. It is clear, then, that in 
this stage of the lateral hypothalamic 
syndrome, the refusal to drink water 
produces dehydration which in turn 
prevents feeding. 

Does dehydration also account for 
the failure to eat in Stage I? Will 
these animals eat immediately after 
operation if they are hydrated? 
Chronic gastric tubes were inserted in 
five animals before they were sub- 
jected to bilateral lateral hypothalamic 
From the time they recovered 
from the anesthetic, usually the day 
after operation, they were hydrated 


have saccharin 


lesions. 
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Food and water intake and body weight of five animals with lateral hypothalamic 
lesions hydrated automatically from the time of operation. 


(Four of the animals died 


after a period of complete aphagia and anorexia despite hydration.) 


with 2 to 3 milliliters of water auto- 
matically every hour. Figure 2 shows 
the results. All animals showed com- 
plete, immediate postoperative apha- 
gia, varying in duration from 2 to 10 
days. Then they all displayed an- 
rather than aphagia, eating 
wet food but not enough to 
maintain their weight. Four of the 
five died. The fifth began 
to eat dry food on the 7th day and 
was able to maintain its weight. When 
hydration stopped 
eating and lost weight; it ate again 
immediately after hydration was re- 
sumed. It had, therefore, recovered 
from its initial aphagia and anorexia 
and passed into Stage III, where it 
ate if, and only if, it was hydrated. 
all of the animals were in- 
itially aphagic and because four of the 
five animals died despite adc -quate hy- 
dration, we conclude that intragastric 
hydration initiated in the immediate 
postoperative period does not assure 
normal feeding (Morrison & Mayer, 


orexia 
some 


animals 


was stopped, it 


Because 


anorexia of 
due just to 


1957). The aphagia and 
Stages I and II are not 
dehydration. 

Why do the rats in Stage III after 
lateral hypothalamic lesions refuse tap 
or distilled water and yet readily ac- 
cept saccharin solutions? Do they 
drink the sweet saccharin solution be- 
cause they are thirsty or do they take 
it in the same way that they take a 
liquid diet—as a food? A normal ani- 
mal increases its intake of saccharin 
solution both when deprived of food 
(Carper & Polliard, 1953) and when 
dehydrated. Four normal rats, de- 
prived of food for 3 days, took little 
or no water but drank increasing 
amounts of saccharin (from two to 
four times as much as they did when 
feeding normally). Three normal rats, 
dehydrated by an intraperitoneal in- 
jection of 2 milliliters of 1 M NaCl 
per 100 grams body weight (Adolph, 
Barker, & Hoy, 1954), showed an 
average intake of 14 milliliters of 0.2% 
saccharin solution in 2 hours, whereas 
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three other normal rats drank 16 milli- 
liters of water when similarly dehy- 
drated. treats a 
saccharin solution as either a food or 
a fluid, depending upon its condition 
of deprivation. When hungry it eats 
a saccharin solution, and when thirsty 
it drinks the saccharin. 


Thus a normal rat 


What does a lateral hypothalamic 
animal do? When deprived of food 
for 3 days, four adipsic lateral hypo- 
thalamic rats increased their daily in- 
take of saccharin solution (from 20 to 
300% more than when feeding freely ). 
However, on repeated occasions, seven 
adipsic rats drank little or no saccharin 
solution in response to intraperitoneal 
There were 20 in- 
stances of complete lack of response 
for as long as 5 hours and five in- 
stances when they drank only 2 or 3 
milliliters. Even a large dose of salt 
that proved lethal failed to elicit in- 
creased fluid intake. lateral 
hypothalamic animals do not increase 
their saccharin intake 
drated but do when deprived of food, 
they are, in effect, eating the sac- 
charin, not drinking it. They are 
hungry, not thirsty. Because they do 
not respond in this stage to levels of 
dehydration — that 
drinking in normal animals, their re- 
fusal to drink water in all three initial 
recovery be due to a 
severe impairment of the central mech- 
anism for water intake regulation. 


hypertonic saline. 


Jecause 


when dehy- 


produce vigorous 


stages of may 


The refusal to drink water may be 
permanent, as illustrated in Figure 1. 
The animal died 200 days after opera- 
tion, never again having drunk water. 
that 
time by “eating” sweet fluid and eat- 
ing dry food. Three other animals 
showed permanent adipsia until they 
died after 267, 230, and 107 days, 
One animal, still living, 


It sustained itself for most of 


respectively. 


has been adipsic for 485 days. 
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Stage IV: Recovery 


Eventually, after passing through 
the stages described above, most rats 
with lateral hypothalamic lesions begin 
to drink water. Because they continue 
to eat food and maintain their weight 
now in the presence of water as the 
only fluid, they superficially appear 
normal and in a sense have recovered. 

However, more sensitive tests re- 
veal that severe deficits in water regu- 
lation still exist. Some animals, for 
examnle, still do not respond at all to 
dehydration by intraperitoneal hyper- 
tonic saline. This deficit has been ob- 
served for months and may very well 
be permanent. Also, four of 
animals when offered water for only 
2 hours each morning did not compen- 
sate by taking their daily requirement 
(approximately 25 milliliters) in that 
time as normal animals do. They took 
little or nothing, became progressively 
dehydrated, stopped eating, and lost 
weight until they were nearly dead. If 
they drank at all, they did so shortly 
after the water was presented. They 
typically 2 milliliters 


these 


drank 1 or 2 
(never more than 5 milliliters), 
to the back of the cage and slept. 
When returned to 24-hour access to 
water, they again managed in an ap- 
parently normal manner. 


moved 


There is a curious paradox here. 
These animals do drink when the 
water is available overnight but do not 
drink in response to rapid dehydra- 
tion with hypertonic salt nor do they 
drink enough when water is available 
for 2 hours in the morning. The 
reason for this apparent contradiction 
was discovered by observing what the 
animals were doing when they drank. 
By attaching a drinkometer (Stellar & 
Hill, 1952) to the water tube and re- 
cording the pattern of water ingestion, 
animals drank 


we found that these 


nothing during the day, but drank in 
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short bursts of rapid licking almost 
continuously throughout the entire 
night. This is when they normally eat. 
In addition, if offered both food and 
water simultaneously at night, they 
began to drink only after they had 
begun to eat. Shortly after eating a 
little dry food, they drank a little 
water, ate more dry food, drank more 
water, and so on throughout the night. 
One of the animals showed this be- 
havior in its most extreme form. It 
would stand near the drinking tube 
holding a pellet in its front paws, 
munch on the pellet for a few minutes, 
lick actively at the water, and then go 
back to chewing on the pellet. Its 
eating was extremely inefficient. It 
would bite large chunks off each 
pellet, chew them a little and then drop 
them while it drank a small quantity 
of water. In the course of the night 
it would waste as much if not more 
food than it actually ate by dropping 
small chunks of it through the mesh 
floor of its 


cage. Its drinking was 


also abnormal. Only small amounts 
of water were drunk during bursts of 
Although 
25 to 30 milliliters of water were con 
sumed, the cumulative record showed 
licking throughout the 
night, interrupted by pauses for short 
The animal licked all 
night to consume an amount of water 
that a normal rat could drink in sev 
eral sustained draughts. It was as if 
the animal drinking not to 
quench its thirst but simply to wet 
its mouth, 
These 


active and precise licking. 


continuous 


bouts of eating. 


were 


observations suggest that 
such an animal drinks only while it 
To demonstrate this, it was de- 
prived of water for 24 hours and of- 
fered access to it for 2 hours during 
the night (8 to 10 p.m.) when it nor 
It drank 22 milliliters of 
water while eating 12 grams of food. 
The amount of water deprivation made 


eats. 


mally ate. 


no difference. It drank only in pro- 
portion to the amount it ate. After 
48 hours of water deprivation it drank 
22 milliliters of water while eating 11 
When the experiment 
was repeated in the absence of food so 
the animal could not eat, it again did 
not drink any water in 2 hours despite 
as much as 48 hours of water depriva- 
tion. As soon as the food was re- 
placed, the animal started eating and 
only after eating for a while did it 
start to drink. The same result was 
obtained in all four animals that had 
previously failed to drink adequately 
in the morning. The’ animals were 
then trained to shift their eating into 
the morning hours (10 a.m. to 12 
noon) by offering them food only at 
that time. Now that they were eating 
in the morning, they drank enough 
water : maintain their 


grams of food. 


in 2 hours to 
weight on this regime for several days. 
Beeause the animals drink only while 
they eat, we call this phenomenon 
“prandial drinking.” 

In the same way that the anorexic 
animals seen in Stage II are eating 
without caloric regulation, so in Stage 
IV, when the animals first begin to 
drink water, they are drinking with- 
out response to bodily dehydration. 
Just as palatability was sufficient to 


arouse appetite for food, so the mouth 
factors involved in eating dry food 
may be sufficient to arouse prandial 
drinking. 

These animals are greatly affected 
by slight changes in the taste of their 


water. Normal rats will drink bitter 
quinine hydrochloride solutions, up to 
a concentration of 1% w/v, if they are 
the only fluids available. When in- 
gested at this concentration, 15 to 20 
milliliters of quinine hydrochloride is 
lethal to a rat. A normal rat, there- 
fore, will drink quinine-treated water 
until it becomes a poison. ~ As this 
point is approached, they drink less 
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and less, gradually stop eating, and 
Lateral hypothalamic rats that 
have been drinking water reject qui- 
nine hydrochloride solutions at a con- 
centration as low as 0.005%. They 
refuse it day after day, stop eating, 
lose weight, and die. But when they 
are hydrated intragastrically with it, 
they thrive. Thus, the refusal to eat 
and drink until death seen in the 
earlier stages of the lateral hypotha- 
lamic syndrome can be reinstated in 
apparently recovered animals merely 
by making their water slightly bitter. 

A normal rat, as it becomes thirstier, 
overcomes the barrier to ingestion im- 
posed by the bitter taste of the quinine 
up to the lethal concentration and 
drinks enough to stay alive. But a rat 
with lateral hypothalamic damage does 
not. Even though it drinks normal 
amounts and maintains itself when 
constantly available, the 


clie. 


water is 


lateral hypothalamic animal fails to 


overcome the quinine taste barrier. In 
addition, it fails to drink adequate 
quantities when access to water is 
limited despite prolonged dehydration, 
and fails to respond to intraperitoneal 
saline challenge. These deficits in mo- 
tivation and in water regulation imply 
that the central neural thirst mechan- 
ism is still severely impaired. The 
powerful urge to drink is still absent. 
These animals drink only while they 
eat, not in response to dehydration. 
There is a striking resemblance here 
to Adolph’s (1957) description of the 
ontogeny of food and water regulation. 
The infant rat develops hunger before 
thirst and, in the neonatal period, does 
not respond to dehydration. It is 
adipsic but not aphagic. 

Although, according to many meas- 
ures, food intake has been normal since 
early in Stage III, more sensitive 
tests again reveal continuing deficits. 
Here, too, taste seems to be an im- 
portant factor. Even though they are 
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apparently eating normally, as_ little 
as 0.05% quinine hydrochloride w/v, 
mixed thoroughly in dry Purina 
powder, is often sufficient to cause 
rats with lateral hypothalamic lesions 
to refuse food completely to the point 
of extreme starvation. Normal rats 
readily accept food with up to 1.25% 
quinine, at which point they begin to 
decrease their intake and lose weight. 
When faced with the bitter taste of 
quinine, rats with lateral hypothalamic 
lesions, therefore, display a greater 
aversion to a negative taste in food. 
Palatability is again a powerful deter- 
minant of food intake in these animals. 
In the anorexic stage (Stage II), only 
highly palatable foods induced them 
to eat. And in the recovery stage, a 
slight negative taste prevents them 
from eating. This resembles the fin- 
icky eating observed in obese hyper- 
phagic rats after more medial hypo- 
thalamic (Teitelbaum, 1955). 
This finickiness implies either an im- 
paired motivation for food (Miller, 
Bailey, & Stevenson, 1950; Teitel- 
baum, 1957) or an increased reactivity 
to its taste; presumably it is present 
in its most extreme form in the early 
lateral hypothalamic 


lesions 


stages of the 
syndrome, 
The recovery process continues even 
after the animal has started drinking 
water again. Some rats become more 
responsive to intraperitoneal injection 
of hypertonic saline and indeed three 
of seven rats tested eventually re- 
sponded as well as a normal animal. 
Similarly in these same three animals 
it required as much as 0.1% quinine 
to prevent them from drinking whereas 
earlier they had been stopped by a 
0.005 or 0.01% quinine solution. A 
gradual recovery can also be seen in 
the gross regulation of water intake, 
food intake, and weight. Immediately 
after entering the recovery stage, and 
for some time thereafter, some rats 
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Fic. 3. 


Stages of recovery seen in the lateral hypothalamic syndrome. 


(The critical 


behavioral events which define the stages are listed on the left.) 


with lateral hypothalamic lesions drink 
water in than normal amounts, 
eat correspondingly less, and so main- 
tain their weights at a subnormal level 
(210 to 220 grams). This hypodipsia 
may last for months, but usually it 
disappears. Such a rat begins to drink 
and eat more, and it regulates its 
weight at 270 to 280 grams, a more 
normal weight for its age. 

The four stages of recovery from 


less 


lateral hypothalamic lesions are sum- 
marized in Figure 3, which lists the 
critical behavioral events whose pres- 
ence or absence defines those stages. 
First, there is a stage of “aphagia and 
adipsia” in which the animal com- 
pletely refuses to eat or drink; second, 
“anorexia and adipsia” in which it 
eats, but not enough to maintain its 
weight, drink; third, 
‘adipsia and dehydration-aphagia,” in 
in which the animal fails to eat only 
because it does not drink; and finally 
the “recovery” stage in which the 
animal drinks water, eats dry food, 
and will survive, but still shows cer- 
tain deficits in eating and drinking 


and does not 


behavior. It must be emphasized that 
stages in recovery will not be 
observed unless the animal is offered 
wet and palatable foods from the time 
of the operation. If only water and 
dry food are available, the rat will die 
unless it rapidly reaches Stage IV, at 
which point it accepts water and can, 
therefore, eat dry food. 


these 


ANATOMICAL ANALYSIS 


The duration of these stages is vari- 
able and seems to depend on both the 
locus of the lesions and the amount of 
The lesions in 13 
animals were studied microscopically 
using thionin-stained celloidin sections 
through the diencephalon. Tissue in 
the lateral hypothalamic areas was 
destroyed bilaterally in all animals. 
Typically, the lesions were large, ex- 
tending medially to the mammillo- 
thalamic tracts and the fornices (the 
fornix _ itself only rarely de- 
stroyed) and laterally into the medial 
portions of the internal capsules and 
optic tracts. Damage was greatest at 


tissue damaged. 


was 
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the level of the ventromedial nuclei. the majority of animals, the most 
The lesions extended anteriorly to the ventromedial portions of the lateral 
level of the supra-optic nuclei and hypothalamic areas were: spared. 


posteriorly into the pre-mammillary Figure 4 shows a frontal section 
regions. Dorsally, the overlying sub- showing lesions in the lateral hypo- 
thalamus was always involved. In_ thalamus at the level of the ventro- 


Fic. 4. Photomicrographs of thionine-stained sections through the tuberal region of 
the hypothalamus showing lesions in five animals with interesting variations of the lateral 
hypothalamic syndrome. (Damaged tissue is outlined in black. A: Large, symmetrical 
lesions producing permanent adipsia. Recovery graphed in Figure 1. B: Transient aphagia 
and anorexia, despite immediate postoperative hydration, followed by recovery of food and 
water intake. Animal 3 in Figure 2. C: Died on sixth postoperative day despite adequate 
hydration. Showed aphagia and anorexia. Animal 5 in Figure 2. D: Asymmetrical lesions 
producing transient adipsia without aphagia. E: Massive lesions involving both medial 
and lateral tissue producing hypothalamic hyperphagia and permanent adipsia. 

Normal animal: Arc, arcuate nucleus; DM, dorsomedial nucleus; Fx, fornix; Hab, 
habenula; IC, internal capsule; Inf. infundibulum; LHA, lateral hypothalamic area; MT, 
mammillothalamic tract; OT, optic tract; Rh, rhinencephalon; VM, ventromedial nucleus: 
VT, ventral thalamus; ZI, zona incerta; III V, third ventricle.) 
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medial nuclei in five animals who 
exhibited interesting variations of the 
lateral hypothalamic syndrome. The 
extent of damage has been outlined 
heavily in black. With the very large 
lateral hypothalamic lesions of Section 
A, each stage in the recovery process 
lasted for weeks or months. This is 
the animal whose recovery is shown 
graphically in Figure 1. This rat 
showed prolonged aphagia and ano- 
rexia (38 days), and it never drank 
water again. Such an animal, because 
of the dehydration-aphagia that is 
present when it refuses to drink water, 
would seem to be permanently aphagic 
It was only by induc- 
ing it to accept a palatable, wet diet 


on a dry diet. 


that the stage of pure adipsia could 
be demonstrated. With smaller or 
asymmetrical lesions, as in Sections 
5% 


very 


, and D, the entire sequence passes 

quickly, and the animal 
begin the recovery not at Stage I, but 
These ani- 


may 


at any of the later stages. 
mals show aphagia and anorexia for 
only a few days or a week if offered 
a palatable liquid diet, and recover 
water intake within 1 or 2 months. 
Section B is from the brain of animal 
2. It refused to eat for 

days despite automatic hydration, 
then nibbled wet food for 4 days, but 
month after 


in Figure 


remained adipsic for 1 


operation. Section C shows the lesions 


of Animal 5 in Figure 2. It 
aphagic and anorexic until it died even 
though hydrated. No attempt 
made to keep this animal alive by tube 
feeding, but since it had already |! 


was 
was 


begun 
to eat appreciable amounts of wet food, 
it probably would have regulated its 
caloric intake within a few more days. 
Section D shows lateral hypothalamic 
lesions which produced only adipsia, 
without any aphagia or anorexia. The 
animal accepted wet mash or liquid 
diet immediately 
weight, but it refused to drink water 


and regulated its 


and showed dehydration-aphagia on a 
dry diet. This effect closely re- 
sembles the earlier findings of adipsia 
without aphagia in the rat by Monte- 
murro and Stevenson (1957) and in 
the dog by Andersson and McCann 
(1956). Finally, Section E, at a 
slightly more rostral level, shows a 
huge lesion that involves the entire 
hypothalamus except for the infundi- 
bulum, destroying both the ventro- 
medial nuclei and the lateral hypo- 
thalamic areas, and extending laterally 
well into the internal capsules. This 
animal was aphagic and anorexic for 
only 4 days, then regulated its intake 
of wet food. It refused to drink water 
until it died 107 days later. When 
offered a wet diet it overate and be- 
obese; but every time it 
offered a dry diet and water, it re- 
fused to drink, stopped eating as a 
result of dehydration, and lost weight 
precipitously. This verifies the earlier 
findings of Williams and Teitelbaum 
(1959), who showed that hypotha- 
lamic hyperphagia can coexist with 
the lateral hypothalamic syndrome. It 
cannot be manifested in Stages I or II 
during which caloric regulation is not 
seen, but in Stage III, when caloric 
regulation appears, so does hypotha- 
lamic hyperphagia. But hydration is 
necessary. With a dry diet and water 
the only fluid available, the animal 
does not drink, and hypothalamic hy- 
perphagia is masked by adipsia. In 
addition, such animals make it clear 
that damage to the ventromedial re- 
gions of the hypothalamus can produce 
hyperphagia even when the lateral hy- 
pothalamic areas are largely destroyed. 

In general, the severity of the def- 
icits was correlated with the size of 
Transient deficits and 
rapid recovery followed small and 
asymmetrical lesions, whereas large 
lesions permitted only slow and_ in- 
complete recovery through stages of 


came was 


the lesions. 
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long duration. This suggested that 
the recovery of function was mediated 
by the undamaged tissue around the 
lesions. Additional lesions in the tis- 
sue immediately adjacent to the origi- 
nal damage were made in six recovered 
animals. In all of them, additional 
lesions found to reinstate the 
After appropriate lesions 
the reoperated animals became aphagic, 
anorexic, and adipsic all over again, 
and the recovery followed the same 
general pattern. The phenomenon was 
particularly clear in one animal. The 
recovery process was repeated four 
Each recovery, in 
this animal, was slower. The fourth 
recovery proceeded only to anorexia 
and was terminated by a fatal gastric 
hemorrhage. Other animals have been 
rendered permanently anorexic by ad- 


were 
syndrome. 


times. successive 


ditional lesions, in one for as long as 
270 days. Therefore, the undamaged 
tissue surrounding the original lesion 
participates in the recovery. In ad- 
dition, this result suggests that the 
experience gained in the initial re- 
from lateral hypothalamic le- 
sions did not protect the animal’ from 
the effects of additional damage. 


COV ery 


Since the lateral hypothalamic syn- 
yme in rats is the result of a com- 
deficits in both 
feeding and drinking, the lateral hy- 
pothalamus must contain tissues con- 
cerned with both functions. The struc- 
involved could be within 
the lateral hypothalamus or fibers of 
from other Morgane 
(196la) reports a syndrome of apha- 


] 
(li 


interaction of 


r¢ 
plex 


tures cells 


passage areas. 
gia and adipsia in the rat with lesions 
in the the globus pallidus, 
completely outside the lateral hypo 
thalamus. He points out that the 
pathways from the 
(ansa and fasciculus 


apex of 


major efferent 
globus pallidus 
lenticularis) are usually interrupted by 
lateral hypothalamic Thus, 
extrahypothalamic partici- 


lesions 


structures 
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pate in the regulation of food and 
water intake 

Whatever the nature of the neuro- 
anatomical systems involved in feeding 
and drinking, some parts of them must 
overlap in the lateral hypothalamus. 
Is it possible to separate them? A 
chemical separation of the systems has 
been demonstrated in the rat by Gross- 
man (1960). He showed that from 
the same site within the perifornical 
region of the hypothalamus, chol- 
inergic drugs elicit drinking and ad- 
renergic drugs produce feeding. This 
implies that the functions of each of 
the systems may be dependent upon 
different neurohumoral agents. 

An anatomical separation may be 
possible because small, asymmetrical 
lesions produce adipsia without apha- 
gia and pure effects have been pro- 
duced by electrical stimulation. Drink- 
ing without feeding has been observed 
in the goat and rat (Andersson & Mc- 
Cann, 1955-56; Greer, 1955), as well 
as feeding without drinking (Larsson, 
1954;: Miller, 1957; Smith, 1956). 
Thus, it is quite possible that two 
systems which do not entirely overlap, 
one concerned with drinking and the 
other with feeding, merge in the lateral 
hypothalamus, and so _ destruction 
there yields the combination of def- 
icits in feeding and drinking behavior 
that makes up the lateral hypothalamic 
syndrome, 

Morgane (196l1b) has suggested a 
further subdivision of the lateral hypo- 
thalamic feeding system. He finds 
that rats with lesions in what he has 
called the “far-lateral’”’ region of the 
lateral hypothalamus (centered just 
medial to the internal capsule) lose 
weight more rapidly while suffering 
aphagia and adipsia and are less likely 
to recover than animals with lesions 
in the “mid-lateral’” hypothalamus 
(centered just lateral to the fornix). 


\ccording to Morgane, recovery is 





LATERAL HyporHALAMIC SYNDROME 87 


the rule following mid-lateral lesions. 
But recovery is the exception, if not 
an impossibility, after far-lateral le- 
sions. He finds it difficult to main- 
tain the far-lateral animal alive on a 
tube-feeding regime that is adequate 
for the mid-lateral animal. He sug- 
gests that the far-lateral region of the 
hypothalamus contains _ pallidofugal 
structures (ansa and fasciculus lentic- 
ularis) that are essential for the basic 
feeding reflexes. Lesions here should 
therefore produce permanent aphagia. 
In addition, the failure of the far- 
lateral animal to survive on tube-feed- 
ings leads Morgane to attibute its 
rapid decline to a form of “metabolic 
failure.” 

In our own experience, as described 
in this paper, many of the animals 
that have shown the typical stages of 
recovery were found to have extensive 
damage in the far-lateral hypothalamus 
(see Figure 4, especially Section A). 
Clearly these animals have very severe 
this we 
agree with Morgane. However, they 
do feed again, although they may re- 
main adipsic. In the early stages of 
their recovery, they can be maintained 
on tube feeding if cared for assidu- 
ously. The more rapid decline in 
weight seen in similar animals and at- 
tributed to metabolic failure by Mor- 
gane can not, therefore, be due to an 
inability to utilize food but may be 
related to changes in activity, tem- 
perature regulation, or metabolic rate. 
Stevenson® has shown that lesions 
within the lateral hypothalamus may 
produce an increase in metabolic rate 
and has suggested that this increased 
energy expenditure may account for 
the rapid postoperative decline of the 
far-lateral animal. In addition, we 
must re-emphasize that lateral hypo- 
thalamic lesions produce deficits in 


and prolonged deficits. In 


6 Stevenson, J. A. F. Personal communi- 


cation, 1961. 


both feeding and drinking. Prolonged 
deficits then may simply represent the 
summated effects of extensive destruc- 
tion of both systems rather than selec- 
tive involvement of “more basic’ and 
functionally distinct portions of the 
feeding system alone. 


SUMMARY AND CONCLUSIONS 


Although we have made interpreta- 
tions throughout this paper as new 
phenomena were described, there are 
several generalizations from this work 
that deserve emphasis. Each of them 
has changed our thinking about the 
lateral hypothalamic control of feed- 
ing and drinking. 

We have confirmed the idea that 
there are tissues in the lateral hypo- 
thalamus that are concerned with both 
feeding and drinking. Aphagia is al- 
ways accompanied by adipsia. We 
have never seen pure aphagia—i.e., 
absence of feeding where drinking is 
unimpaired. Any impairment in feed- 
ing then that follows these lesions can 
only be fully understood as the com- 
bined result of deficits in both regula- 
tions. 

Recovery is the rule following these 
lesions, if the animals are maintained 
long enough. Barring accidents, or 
gastrointestinal pathology, it is pos- 
sible to keep these animals alive in- 
definitely. Although we have seen 
prolonged deficits, we have never seen 
an animal with lateral hypothalamic 
lesions that did not show some re- 
covery. 

Feeding recovers before drinking. 
The quality of the diet is crucial for 
this observation. At first the animal 
will eat only wet and palatable foods. 
sut if it is offered nothing except dry 
pellets and water, it will not eat until 
after it has begun to drink, and this 
relationship will be misunderstood. It 


appears to us, therefore, that lateral 
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hypothalamic lesions impair feeding 
less severely than drinking. 

ating does not necessarily imply 
regulation. When the animals first 
begin to eat they are anorexic. They 
eat only highly palatable foods, and 
do not eat enough to regulate caloric 
intake or body weight. Their feeding 
seems to be evoked by the taste and 
smell of the food. 
to it by appetite, not driven to it by 
hunger. 


They are drawn 


Hunger returns, often abruptly. A 
new control reappears, enabling the 
animal to regulate its caloric intake. 
The central feeding mechanism is now 
responsive to some variable associated 
need If this 
were identified, we would know 
control that is sufficient for 
caloric regulation in the normal rat. 


with caloric variable 
one 


system 


Even after the return of regulatory 
It can mask 
caloric regulation if only dry food and 


feeding, adipsia persists. 


The animal will 
not drink water and aphagia will be 
the result of dehydration. If the ani- 
mal saccharin 
tions, it hydrates itself involuntarily 
then pellets. Thus, 
adipsia can occur without aphagia. 


water are available. 


is induced to eat solu- 


and eats dry 


\dipsia can be permanent. Some 
animals drink 
though out 


never water again, 
they their natural 
In our experience, aphagia is 
always transient. We have never seen 


live 


lives. 


permanent aphagia after lateral hypo- 


thalamic lesions. 
belief that the 


regulating thirst is more completely 


[This confirms our 
central neural tissue 
impaired by these lesions than is the 
tissue controlling feeding. 

Drinking frequently reappears. Ini 
tially the drink 


Just as anorexia is feeding 


animals only while 


they eat. 
starvation so 


without response to 


prandial drinking is drinking without 


response to dehydration. Here again, 
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ingestion does not necessarily imply 
regulation. Like hunger, thirst may 
eventually reappear. The central 
thirst mechanism is now responsive 
to dehydration. 

Although regulation of 
water intake has recovered, motiva- 
tion is still impaired. The recovered 
animal is greatly affected by the taste 
of both its food and water. Addition 
of small amounts of quinine deters it 
from eating and drinking. Wherever 
taste is excessively important, motiva 
tion is impaired. 


food and 


Finally, in the lateral hypothalamic 
syndrome, the initial transition from 
aphagia to anorexia and from adipsia 
to prandial drinking seems to depend 
upon recovery of the influence of pe- 
ripheral sensations (taste and smell 
of food, or dry mouth). The next 
transition the recovery of 
central neural controls permitting not 
only ingestion but ingestion with reg 
ulation. Anorexia gives way to hun- 
ger and prandial drinking to true 
thirst. This supports the commonly 
accepted view that feeding and drink- 
ing are both under multiple control 
(Brobeck, 1960; Stellar, 1954). Be 
cause all controls may operate simul- 
taneously in the normal animal, it has 
been difficult to identify those essential 
for regulation. In the lateral hypo 
thalamic syndrome, the reappearance 


involves 


of these controls is separated in time 
during recovery. They are not all 
operating simultaneously. Controls that 
operate in the later stages of recovery 
It should 
be possible to identify both the con 


are absent in earlier stages. 


trols that are present and those that 
are absent during any particular stage 
by subjecting the animal in that stage 
to the 


fee ling 


variables known to influence 


and drinking in the normal 


animal. If we find controls that are 


absent before regulation, and which 
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reappear with it, we may then be able 
to specify the controls necessary for 
regulation. 
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A STATISTICAL THEORY OF DYNAMIC 
CONTOUR PERCEPTION ° 


W. M. Smita 
Dartmouth College 


Bartley’s (1941) statement made al- 
that the 
formation is 


most twenty years ago, “ 
matter of contour 
of the central phenomena of sensory 


one 


psychology and nerve physiology” (p 


334), certainly is as true today as it 


Indeed, the 
in which borders are perceived 


was when he wrote it. 
way 
is a fundamental problem in visual 
science. 

It is very clear that the classical 
idea of the retinal mosaic as a system 
of insulated meet 
adequately various facts of visual acu- 
ity, brightness discrimination, and 
contour definition. Visual acuity is 
greater than the classical notion im 


receptors cannot 


plies, and contour perception is not 
dependent upon two parallel arrays of 
receptors, array stimulated, the 
other Inadequate consideration 
of postretinal structures has been the 


one 
not. 


major failing of what is now consid- 
ered to be the classical view of visual 
acuity. 

The Marshall and Talbot 
(1942) is a formulation of a theory of 
visual acuity and contour resolution 
in terms of the structure and the func- 
tion of retinal and postretinal neural 
elements. Drawing upon recent find- 
ings and concepts of neurophysiology, 


we rk ( of 


they view such basic processes of acu- 
itv and contour perception as depend- 
ent upon the interrelated activity of 
populations of receptors and neural 
elements. Appropriately, their formu- 
lation now is referred to as a statis 
tical theory of visual function. 
\lthough we shall not present a re- 
This 
National 


work has been supported by the 


Science Foundation 


W. L. 


University of Delaware 
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view of the Marshall and Talbot 
work, for expository purposes later it 
will be helpful to mention some gen- 
eral features of their theory here. 
Their analysis rests heavily upon the 
neurological reciprocal 
overlap, lateral and vertical summa- 
tion in the visual system, physiological 
nystagmus, and the nature of the neu- 
ral recovery cycle. Especially impor- 
tant is the fact that their theory places 
the locus of the critical aspects of con- 
tour perception in the visual cortex 
rather than at the retinal level. 


concepts of 


In their original work Marshall and 
Talbot were concerned primarily with 
the general question of sensory acuity, 
but in recent other in- 
vestigations have extended the statis- 
tical theory to other aspects of vision. 
Osgood and Heyer (1952), for ex- 
ample, have applied it to the analysis 
and interpretation of figural afteref- 
fects and have claimed the statistical 
theory to be superior to the field con- 
ception offered by Kohler and Wal- 
lach (1944). Regarding figural after- 
effects, however, both the field and the 
statistical interpretations have had 
their critics (Deutch, 1956; Smith, 
1952). More recently, Day (1956, 
1957) has used statistical theory in 
the interpretation of phenomenological 
reports and experimental data con- 
cerning form perception. In particu- 
lar, he has employed statistical theory 
the the 
emergence of form in the peripheral 
data from 
area-perimeter 
ratios in form perception. Finally, 
Osgood (1953, pp. 245-47) has pre- 


more years 


in considering question of 


retina and in interpreting 


studies of area and 
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sented a brief account of how statis- 
tical theory can be applied to the 
problem of apparent movement. 

It is with the perception of move- 
ment that the present 
cerned. We shall attempt to extend 
statistical theory to a treatment of 
First, we shall 


paper is con 


perceived movement. 
describe briefly the basic experimental 
situation. shall summarize 
the results from which the theoretical 
Finally, we 


Then we 


treatment was developed. 
shall present an account of our modi- 
fication and statistical 
theory with supporting evidence. 


extension of 


DYNAMIC CONTOUR PERCEPTION 


For present purposes we define “‘dy- 
namic contour perception” as the per- 
ception of the borders of moving stim- 
uli. The definition could be modified 
to include instances wherein the ob- 
server moves rather than the stimulus. 
We shall 
tion here. 

In two previous papers (Smith & 
Gulick, 1956, 1957) data have shown 
that the perception of the borders of 


not consider this modifica- 


moving stimuli is influenced not only 
by the velocity of the stimulus, but also 
by the length of time the stimulus is 
observed in a fixed position prior to 
movement. The 
periments indicated that the border of 
a small square 
clearly at velocities exceeding about 


results of these ex- 


could not be seen 
13 degrees of visual angle per second. 
However, when the stimulus was pre- 
sented in a fixed position prior to 
movement, the maximum velocity at 
which the edges could be perceived 
two or 


was increased by a factor or 


more in most instances. A systematic 
study of the combinations of velocity 
and initial exposures needed to main 
tain contour revealed a linear relation 
ship up to exposures of about 350 
milliseconds and velocities of about 30 
visual angle second 


degrees of per 
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For every increase in velocity that pro- 
duced loss of contour, an increment in 
the duration of the initial exposure 
could be utilized to regain it. In more 
recent unpublished studies in which 
the duration of exposure of the initial 


stationary phase of the stimulus has 


been extended to 1 second, we find 
that the ability to see an edge with 
increasing velocities of movement is 
related curvilinearly to the duration 
of the initial stationary phase. At 
velocities exceeding about 40 degrees 
per second it appears that the fixed 
phase of presentation, regardless of its 
duration, will not 
a moving edge clearly. In 
paper (Smith & Gulick, 1957) we of- 
fered a preliminary statement of sta 
tistical theory as applied to the com- 
bined effects of the initial exposure of 
the stimulus in the fixed position and 
the movement phase. In the present 
paper we wish to expand this pre- 
liminary theoretical treatment. 


enable one to see 


an earlier 


BACKGROUND OF THE 


MopEL 


HISTORICAL 


When an observer’s total visual 
field consists of a light-dark pattern 
divided vertically, there is no apparent 
gradient of gray light 
dark. The discontinuity is clearly de 


fined 


where meets 
How the visual system is able 
to “construct” an edge from such a 
distribution of light striking 
receptors is not under- 
However, it is known that the 
reflected light 
a stimu- 


energy 
visual well 
stood. 
abrupt gradient of 
which exists at the border of 
lus object is not present once the 
pattern of light 
(Fry, 1955). The abrupt gradient be 
comes less abrupt so that the intensity 


reaches the retina 


of light comprising the geometric 
image of a border at the 
proximates the integral of a normal 


It is this pattern of light 


retina ap- 


distribution. 
which courses back and forth 


energy 
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Physical 
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Excitation 
(impulses / sec) 





Retinal space 


Fic. 1. 
to cortical 
formed to 
the cortical 
correlate « 


Hypothetical distributions illustrat 
space ( The 
leptokurti 
d, the distribution 
yf perceived contour.) 


platykurtic 
distribution 
of 


ment 


a more of 


1 1 
ievel, exe 


across population of 
the 
ments. 

If, instead 
observer 


a receptors as 


eye undergoes nystagmic move 


of single border, an 
a pattern 
parallel borders, such as a bright, thin 
line threshold 
the distribution of light of this pattern 
on the retina approximates a normal 
As the angular width of such 
pattern increased, the symmetrical 
distribution at the retina be 
changed progressively, becoming flat 


a 


shown of two 


1S 


vertical (near value ), 


curve. 
a 
would 
tened on top. Each border, however, 
would be represented on the retina by 
a distribution of light similar to that 
for the discontinuity of the vertically 
divided field mentioned previously. 
It is the central proposition of sta- 
tistical theory that the cortical repre- 
ot 
symmetrical distribution of excitation 
in Area 17. 
tion 


sentation a perceived edge is a 


The leptokurtic distribu- 


of excitation is correlated with 


the abrupt change in intensity at the 


stimulus boundary, but only by virtue 


Stimulus gradient 


Contour PERCEPTION 


b 


Intensity at 
fovea 


Physical 
intensity 


' 
Retinal space 





Excitation 
(impulses/ sec) 





Cortical space 


ing transformation of stimulus from environ 


dis 


the fovea, b, is trans 
system, c. At 
the direct 


of light at 
within the afferent 
peaked further 


tribution 
excitatior 


itation is and is 


of a very complicated series of events 
and mechanisms intervening between 
The abrupt change 
level in the stimulus is not 
at the 
introduced 


cornea and cortex. 
of 
re] re »duced 


energy 


retina because of 


distortion by 
through ocular media. Slight irreg- 
ularities of the of the 
errors of refraction and of aberration, 
and entopic stray light all tend to re- 
duce the abruptness of the stimulus 
gradient it 1 the 


passage 


surface eye, 


as is projected on 
retina. 

In Figure 1 we have attempted to 
show schematically the transformation 
from stimulus gradient to cortical dis- 
tribution. <A the visual field 
presents an abrupt gradient of energy. 
At the retina the distribution of energy 
assumes the platykurtic form as shown 
in Figure 1b. The relative flatness of 
this distribution is due to the scatter- 
ing of light by the distorting factors 
noted previously, plus the effects of 
This 
tribution of light represents the prox- 


line in 


physiological nystagmus. dlis- 
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imal stimulus which triggers the pri 
mwaywry receptors. 
different retinal 
distribution 


of neural firing fot 


loci, then a of excitation 
results. This hypothetical distribution 
is shown in Figure le. The peaking 
of this distribution is produced by the 
combined effects of numerous mecha- 
nisms. First, neural elements located 
near the center of the distribution in 
Figure 1b are stimulated by the most 
intense light. Accordingly, the num 
ber of fibers stimulated as well as the 
average rate of firing of these neurons 
is greater than in those neural elements 
located near the edges of the distri 
bution. Second, as a result of physio 
logical nystagmus, centrally located re- 
ceptors are stimulated more efficiently 
because the rate of application of the 
maximized in 
located near the 
slope of the distribution of 
(Rushton, 1935). If we 
primary “carrier” of contour activity 
to be the on-off fibers, then these fibers 
serving centrally 
will have a higher rate of firing than 
those found at the 
periphery of the population of recep- 
Third, because of the 
faster rate of application of stimula- 
tion just noted, the amount of neural 
centermost 
Neural ac- 
Nernst 
a counter-process of some 
cuts short 
As the rate 
of application of a stimulus is in- 
amount of accommoda- 
(Monnier, 1934). 
accommodation ac 


light energy is those 


receptors maximum 
light 


assume the 


receptors located 


serving receptors 


tors involved. 


accommodation in_ these 
fibers will be minimized. 
commodation, according to 
(1908). is 
kind in the which 


the effect of a stimulus. 


neuron 


creased, the 
tion is decreased 
Whether neural 
tually operates in the vertebrate retina 
remains to be demonstrated conclu- 
sively, but there is some evidence to 
suggest that it does (Granit & Skog 
lund, 1943). The result of the in- 
creased rate of application of the stim- 
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ulus, and hence decreased accommo 
dation, produces an additional relative 
increase in excitation rate for the neu 
rons associated with the center of ac 
tivity in the retina (Skoglund, 1942). 

There are, in addition, possible bases 
for peaking the excitation through the 
action of retinal receptive fields ( Bar 
low, 1953a, 1953b; Hartline, 1940; 
Kuffler, 1953). <A receptive field of 
a particular ganglion cell typically is 
organized with antagonistic discharge 
patterns of the center and the periph 
that is, the center may be pre- 
dominantly “on” and the periphery 
of the field “off,” or vice versa. The 
intermediate area of the field will be 
Thus, any displacement of 
a border across such a field as might 
be produced by slight movements of 
the eye will result in continuous ex- 
citation of the ganglion cell. If the 
field is an ‘off-center” type and the 
stimulation from 
center to periphery then there ‘iil be 
bursts of excitation on leaving center, 
entering and leaving the intermediate 
area, and upon entering the peripheral 
part of the field. Any kind of scanning 
and forth this field ob- 
will maximize excitation of 
the ganglion cell associated with the 
If to these complex 


ery, 


“on-off.” 


gradient of passes 


back across 


viously 


receptive field. 
events occurring at the retinal level 
we add the effects of reciprocal over- 


lap occurring during projection, the 
distribution of excitation in the visual 
cortex is peaked further and we have 
the leptokurtic distribution as shown 
in Figure 1d. 

The phenomenon. 
periments, a stimulus (4) 
moving horizontally 
distance (D), from left to right. Its 
velocity (V) was increased until the 
edges of A no longer could be seen 
clearly. At this velocity, if 4 
exposed briefly in a fixed position prior 
(this duration we called 


In our early ex- 
was ob- 


served through 


were 


to movement 
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ih: 4 


This effect was systematically 


was seen with sharp edges 
nves 
tigated for velocities from 0 to approx 
imately 30 degrees of visual angle per 
second. When of V 
plotted against values of t, a 
relationship was obtained. In 
to maintain the contour of a moving 


stimulus, each increase in I’ of | 


values were 
linear 


order 


de- 


gree visual angle per second required 


a corresponding increase in t of ap- 
proximately 25 milliseconds. 

The addition of the prior stationary 
phase to movement definitely 
tated the perception of the edges of the 
moving stimulus. Stemming from a 
statistical treatment of apparent move 
ment 1953, pp. 245-47), 
wherein two separate distributions of 


facili- 


(Osgor d, 


excitation in the cortex sum into one 
distribution whose peak is displaced 
in cortical space, a tentative explana- 
tion of our contour data was offered. 
that 


moving 


The hypothesis was excitation 
arising from the 
summed with the 
around c, (the cortical locus of 
imum activity produced by the fixed 
stimulus) to generate a single peaked 
distribution moving through cortical 
space toward (the cortical 
locus of maximum activity produced 
by the moving stimulus at the termina- 


stimulus 
activity 
max 


decaying 


locus c, 


tion of movement). 


THE MopeEL 


It is now appropriate to state some 
assumptions upon which our statistical 
concept of dynamic contour is based. 
of assumptions follow 
closely those made by other statistical 


Some these 


theorists; others are especially ger 
mane to the question of dynamic con 
tour. 


1. On-off and off receptors are the 
principal retinal mediators of contour 
perception. The excitation 
from the stimulation of these two types 


ensuing 
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of receptors becomes the cortical rep 
border. 
their response characteristics it is as 


resentation of a Because of 
sumed that on fibers play a minor role. 
2. Each an infinite number of 

border of a stimulus 
represented in the visual 
normal or 
distribution of excitation (rate of firing 


of 
points along a 
(hairline) is 
cortex by a near-normal 
of neural elements extended over cor- 
tical space ). 

3. The continuum of points along 
the border of the stimulus repre- 
sented in the cortex by a continuum 
of points of maximal excitation, the 
longitudinal extent of which is propor- 
tional to the length of the border. 

+. The mean slope of the cortical 
excitation distribution, rather than the 
absolute amount of excitation, defines 
The sharpness 
of the edge is determined directly by 
the relationship between the amount of 
excitation at maximum amd the 
of of excitation 
through cortical space. It is assumed 
that the mean slope of the distribution 
of excitation increases through time 
in a negatively accelerated 
Only when the slope equals or exceeds 
critical value range contour 
perceived. This implies some finite 
period of time as a necessary antece- 
dent to contour perception. 

The slope of the excitation distribu- 
tion in the cortex at any instant in time 
is determined by the intensity of light, 
contrast, and by the state of the eye 
and proximal neural elements. 

5. Overlapping distributions of ex- 
citation in the cortex summate. 

6. The mean slope of the distribution 

excitation also influenced 
movement: slope is 
versely related to velocity. 

7. After of stimulation 
arising from a fixed or moving border, 
the mean slope of the excitation dis- 
tribution decreases through time in a 


is 


the clarity of an edge. 


its 


amount dispersion 


fashion. 


some is 


of is 


velocity of 


by 
ini- 


cessation 
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Fic. 2. Hypothetical curve depicting the 
distribution through time. (S, 
tion of contour. 
conditions. 
stimulation. ) 
negatively accelerated fashion. The 
mean slope at any given instant in time 
during decay is influenced by the in- 
tensity of the light: the higher the 
intensity, the steeper the slope. 

When light stimulates the retina, 
the level of cortical excitation related 
to it grows in a negatively accelerated 
manner. Moreover, and of more im- 
portance, the excitation distribution 
representing a contour 
mean slope through time in a nega- 
This in- 


increases in 


tively accelerated fashion. 
crease in slope is shown in Figure 2. 

This hypothetical function indicates 
that in approximately 50 milliseconds 
the slope (S) reaches a value neces- 
sary for sharp 
(S.). Although this temporal value is 
to some extent arbitrary, it illustrates 
here that the sharp 
edges in the visual field takes a finite 


contour perception 


appearance of 


critical slope 
The slope of the function in the figure would change with various stimulus 
In this example, contour is perceived about 50 milliseconds after initiation of 


increase in the mean slope of the excitation 


signifies the value necessary for the percep 


period of time to develop. There is 
no single instant when a sharp edge 
Rather, the transition from 
“fuzziness” to a clearly defined edge 
is a continuous matter. The value of 
50 milliseconds in Figure 2 represents 
a general threshold estimate for the 
edge. 
influence the 
attained, 


appears. 


appearance of a_ well-defined 
Numerous 
rapidity with 
among which are intensity level, de- 
gree of contrast, and level of retinal 
adaptation. Regardless of specific 
conditions of observation, however, the 
delay in reaching S, after the onset 
of stimulation is due to the temporally 
dispersing factors of synaptic delay, 
reverberation, and horizontal spread 
of neural activity as it proceeds from 
the receptors to the visual cortex. 
When a border is removed from 
the visual field, its related excitation 
distribution decays with negative ac- 


factors 
which S,. ts 


can 
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celeration. 
this distribution also decreases and in 
( Assump- 


Furthermore, the slope of 


the same general manner 
tion 7). 
thetical decay function showing the de- 
crease in mean slope of the excitation 
distribution function of time. 
Note that this decay function is influ- 
enced to an important degree by the 
of the stimulus its in- 
The prolonged decline in the 


Figure 3 represents a hypo- 


as a 


duration and 
tensity. 
mean slope is due to the continued ex- 
citation as a of the reluctance 
of the on-off off fibers the 
optic nerve to cease activity. The 
nature of this decay is based primarily 
upon evidence obtained by Hartline 
(1938) of residual activity persisting 
for many seconds, although gradually 
Because the slope during 
for some brief but 


result 


and in 


subsiding. 
decay exceeds 5, 
finite period of time after the cessation 
of stimulation, an edge can be per- 
ceived during this brief interval. 


excitation 


oO 
N 
oO 


Mean slope of distribution of 
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The hypothetical functions depicted 
in Figures 2 and 3 represent the in- 
crease in mean slope as a function of 
time produced by an initial stationary 
stimulus (Figure 2) and the decrease 
in mean slope as a function of time 
produced by the cessation of the same 
stimulus (Figure 3). When a moving 
stimulus is presented to the eye, both 
the mean slope of excitation and the 
level of excitation increase in the same 
general manner as that already dis- 
cussed for stationary stimuli. In Fig- 
ure 4 are shown three hypothetical 
functions describing the mean slope, 
for each of three velocities of 
movement. Assuming other condi- 
tions constant, they illustrate that the 
time required to attain S, is deter- 
mined by velocity of movement. In 
Assumption 6 it was stated that the 
mean slope of the distribution of ex- 
citation produced by a stimulus in mo- 
tion was affected by the velocity of 


one 
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Hypothetical curve of decay in mean slope of excitation distribution through 
(In this example, contour is perceived for a few milliseconds after cessation of 


stimulation until the slope of the excitation distribution falls below S.-.) 
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Three 


of movement increases, more 
necessary for contour perception 
movement. At low velocities (V < 
10°/sec) the time required to attain 
S. is approximately equal to that re- 
quired to attain it with a stationary 
stimulus (Curve a). However, at 
higher velocities (V > 10°/sec) more 
time is required before S, is attained 
(Curve b). Indeed, if V is sufficiently 
high, S. will never be reached (Curve 
c). As illustrated with these hypo- 
thetical functions, when velocities are 
very high (Curve c) the mean slope of 
the excitation distribution never 
reaches the critical value, S.. There- 
fore, clearly defined edges cannot be 
perceived. It is under these 
tions that the presence of the initial 
stationary phase of the stimulation se- 
quence becomes useful. 

What is now required is to elucidate 
the manner in which the excitation 
arising from the stationary stimulus 
sums with the excitation produced by 


condi- 


showing 
excitation distribution changes over time for different velocities of movement. 
time is required for the mean slope to reach Se, the level 
At high velocities S, 


slope of the 
(As velocity 


how the increase in mean 


may not be attained—Curve c.) 
the subsequent moving stimulus so 
as to maintain jointly a value of S, 
which is sufficient for contour percep- 
tion. It is our hypothesis that the 
maintenance of contour during stim- 
ulus movements as a result of the 
presentation of the initial stationary 
phase is achieved by the summation 
of the distributions of excitation aris- 
ing from the total stimulation se- 
quence. Excitation resulting from the 
moving stimulus is added to the resid- 
ual remaining after the withdrawal of 
the stationary stimulus, and their sum 
under certain conditions can be suf- 
ficient to increase the mean slope of 
the combined excitation distribution 
to equal or surpass S, at a time much 
earlier than would otherwise have ob- 
tained had S, been attained solely on 
the basis of excitation from movement. 

Even under conditions of static con- 
tour perception a brief interval is re- 
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Cortical space 


Fic. 5. Theoretical curves showing how the combined distributions of excitation for 
the moving contour (regular broken lines) and residual and decaying excitation from the 
initial fixed exposure of the stimulus (dot-dash lines) combine (solid curves) over time 
to produce contour perception. (As time increases from the moment of the initiation of 
movement to 20 milliseconds, a; 40 milliseconds, b; 60 milliseconds, c; etc., the level of 
excitation related to the moving border increases as it moves through cortical space. The 
level of excitation of the combined distribution also increases through time. Here it may 
be seen that the slope of the distribution representing the moving stimulus, Sm, increases 
through time. The slope of the combined distribution, Seom, increases through time at a 
faster rate, and thereby hastens the attainment of the critical slope, S., for contour resolution. 
The symbol C; represents the cortical location representing the peak of excitation from 
the stationary phase, and C, the cortical location of the peak of excitation from the moving 


phase at the end of movement. The abscissa represents cortical space in arbitrary units.) 
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quired before S, is reached because of 
the temporally 
ready mentioned. 
ably increases slightly when a stimulus 


dispersing factors al- 
This interval prob- 
moves slowly; but when velocity ex- 
ceeds 10 degrees or so per second, the 
Within 
the range of velocities for which the 
initial fixed stimulus is required in 
order to maintain contour, larger and 
larger residuals of excitation must be 


interval increases markedly. 


added to the excitation produced by 
movement if contour is to be present. 
The result of this 
increase the slope of the combined ex 


summation is to 


citation distribution, and hence hasten 
the attainment of S.. 

This illustrated for 
increasing temporal intervals in Figure 


summation is 
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When the velocity of movement 
stimulus is constant, the rate of dis- 
placement of the peak of excitation in the 
cortex is also constant (solid line). How- 
ever, when the decaying activity from the 
fixed stimulus sums with the growth of 
activity produced by the moving stimulus, 
the peak of the combined distribution is not 
displaced through cortical space at a 
stant rate (dashed line). 
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5. Note in Figure 5 that the excita- 
tion distribution for the fixed stimulus 
(dot-dash line) decays through time, 
thereby producing a distribution of 
Note 
that the slope of the excitation distribu- 
tion representing the moving stimulus 
(dashed line) increases through time, 
as does the combined distribution 
(solid line). The most important 
point illustrated by the set of curves, 
however, is that the mean slope of 
the combined distribution increases at 
a faster rate than does the mean slope 
of the excitation distribution for move- 
ment. This means that the perception 
of the edge of the moving stimulus is 
facilitated and can be seen at an earlier 
point in time than would be the case 
had there not been the contribution to 
excitation by the decay of the stimula 
from the fixed stimulus. 

Figure 5 demonstrates another im- 
portant aspect of this summation proc- 
According to the Marshall and 
Talbot (1942) view, borders of a stim- 
ulus are represented by the peaks of 
(distributions standing for points along 
the borders or in their terms, a ridge 
of excitation is produced by an infinite 
number of points along the edge of a 
stimulus. For static contour percep- 
tion the location of the stimulus edge 
in the visual field and its cortical loca- 
tion bear a rather straightforward re- 
lationship. In dynamic contour per- 
ception, however, this relationship be- 


progressively less slope. also 


tion 


ess. 


comes more complex. If we assume 
the eye to be fixed, when the velocity 
of movement of the stimulus object is 
constant, so is the velocity of the mov- 
ing peak of excitation as it travels 
through cortical space. In dynamic 
contour perception upon the 
summation of excitation distributions 
the peak of total excitation does not 
travel through cortical space at a con- 
stant velocity. In Figure 5d the peak 
of the combined distribution can be 


based 
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seen to lag behind the peak produced 
by the moving border alone (by about 
one unit of cortical space). The dif- 
ference in the rates of displacement 
for the peak of the distribution of 
excitation produced by the moving 
border and the peak of the summed 
distribution of excitation are presented 
in Figure 6. This figure illustrates 
that the rate of displacement of the 
peak of excitation for the moving stim- 
ulus alone is constant; hence the 
straight line. However, the rate of 
displacement of the peak of excitation 
for the combined distribution is not 
constant, but first negatively accele- 
rated and then positively accelerated. 
It is hypothesized that this change in 
rate of displacement of the peak of 
the distribution of the summed ex- 
citations is responsible for a peculiar 
effect reported by our subjects; 
namely, that the velocity of the moving 
edge did not appear to be constant. 
This account of variable 
velocity with constant velocity of 
movement we believe to be one 
portant piece of supporting evidence 
for the theory presented here. 
For low 
is attained with suf- 


perceived 


im- 


To summarize: velocities 


of movement S, 


ficient speed to allow virtually instan- 


taneous With 
high attained 
during the time interval required to 
complete the stimulus movement se- 
quence. For the intermediate range of 
velocities (8-30°/sec), S. is attained 
rapidly enough to permit contour per- 
ception only if the decaying neural ac- 
tivity produced by the fixed stimulus 
sums with the growth of neural ac- 
tivity produced by the moving stim- 
ulus. 


contour 
velocities S. is 


perception. 
never 


SUPPORTING EVIDENCE 

Lack of techniques for specific meas- 
ures of the integrative activity of the 
visual system makes difficult any di- 


‘ 
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rect test of statistical theory as ex- 
pressed here. However, indirect evi- 
dence of a supporting nature can be 
obtained from various perceptual ex- 
periments. We have mentioned pre- 
viously the evidence concerning the 
perceived variable velocity of a con- 
stant velocity stimulus. To other sup- 
porting evidence we now turn. 

Contrast. We have said that con- 
tour perception cannot occur unless 
the slope of the cortical distribution 
representing contour reaches some 
minimal value (S.). One factor 
which should have a direct and im- 
portant bearing on the activity of on- 
off fibers, and hence the peaking of 
the related cortical distribution of ex- 
citation, is the intensity difference rep- 
resenting a border. It was hypoth- 
esized, therefore, that under conditions 
of high contrast between a stimulus 
object and its surround, the perception 
of a moving edge could occur at veloci- 
ties of movement higher than those 
which would be expected under low 
contrast. Because high contrast would 
decrease the time required to obtain 
S,., the importance of the summative 
effects of decaying neural activity fol- 
lowing the presentation of the initial 
stationary stimulus would be mini- 
mized. That is, a stimulus of high 
contrast at a given velocity should re- 
quire less contribution from the de- 
caying effects of the initial stationary 
stimulus than a stimulus of the same 
velocity and lower contrast. The ma- 
jor contribution of the initial stationary 
stimulus is to provide, after its cessa- 
tion, neural excitation which may sum 
with that produced by the moving 
stimulus. This summation effect sim- 
ply hastens the attainment of S, during 
movement (see Figure 5). 

In terms of our operations the result 
of an increase in contrast would be to 
allow the perception of contour of 
the moving stimulus at a_ higher 
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The effects of contrast and illu- 
level on contour perception of 
moving stimuli. (Data of five subjects. 
Brightness values in footlamberts of the 
background and the stimulus: d, 4.2-1.2; 
c, 4.2-1.5; b, .036-.007; a, .036-.015. The 
stimulus was always darker than the back- 
ground. Stimulus size equaled 0.5 degrees, 
and the extent of movement was 5.0 degrees 
of visual angle. Each curve in the figure 
specifies how long the stimulus had to ap- 
pear in a fixed position prior to movement, 
ti, in order for contour to be perceived dur- 
ing movement of a particular velocity.) 


Fic. 7. 


mination 


velocity than under conditions of low 
contrast, even though the duration of 
the fixed stimulus remained constant. 
Conversely, for a given velocity of 
movement, a reduced exposure interval 
of the initial fixed stimulus is adequate 
to maintain contour perception during 


movement whenever contrast is in- 
creased. 

Data on the effects of contrast are 
presented in Figure 7. For each of 
five subjects, velocity of movement 
was increased from 8 degrees per 
second in 2-degree steps until contour 
was lost (about 13°/sec). There- 
after, the value of t, required to see 
contour was determined for each in- 
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crease in V, In Figure 7 mean values 
of t, are plotted against V for each of 
two contrast conditions at each of two 
levels of illumination. These data 
show that increased contrast selectively 
displaces the time-velocity contour 
function in the expected direction: 
higher contrast enables the subject to 
perceive the edge of the moving stim- 
ulus at higher velocities with a con- 
stant value of t,. Under low illumina- 
tion the increase in contrast shifted 
the function from a to b, and under 
high illumination, the increase in con- 
trast shifted the function from c to d. 

Intensity level. The data plotted in 
Figure 7 also show an effect one 
would predict from statistical theory ; 
namely, that contour perception of 
stimuli in motion is influenced by in- 
tensity. In Assumption 7 we stated 
that the mean slope of the decaying 
distribution of excitation became 
steeper as stimulus intensity increased. 
Furthermore, the rise in the mean 
slope of the distribution of excitation 
from the moving stimulus with in- 
creased intensity will be faster and will 
hasten the attainment of S,. Accord- 
ingly, the importance of the summative 
effects of the initial stationary stim- 
ulus would be reduced. 

As seen in Figure 7, the result of 
increasing intensity shifted the time- 
velocity contour function in the ex- 
pected direction; under low contrast, 
increased intensity shifted the function 
from a to c, and under high contrast 
an increase in intensity shifted the 
function from b to d. 

Maintaining a constant contrast be- 
tween the stimulus and its surround 
while increasing intensity should and 
does facilitate contour perception, but 
only up to a point. If adjacent areas 
of a visual pattern differing in in- 
tensity have their intensities increased 
proportionately, a level will be reached 
where the necessary conditions for sta- 
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tistical peaking will no longer be pos- 
sible. Thus if the areas defining a 
contour are both sufficiently intense, 
on-off receptors would not respond 
and the presumed normal distribution 
of excitation would degenerate. 

Spatial displacement. Our account 
of how decaying neural activity from 
fixed stimulation combines with that 
produced by a moving stimulus re- 
quires particular spatial and temporal 
relationships of these two parts of the 
stimulation sequence. Any change 
which destroys the spatial and tem- 
poral contiguity of the two phases 
should have an effect upon 
contour formation. 

We predicted that the introduction 
of a spatial separation in the visual 
field between the position of the fixed 
stimulus and the initial position of the 
moving stimulus would require a 


adverse 


higher value of t, to allow contour 
perception 


than 
would be required in a control condi- 
tion without spatial separation. The 
rationale for the separation in the 
visual field of the two phases of the 
stimulation sequence was to separate 
the assumed distributions of excitation 
produced by them in the visual cortex. 
It was necessary to provide the sub- 
ject with a minute fixation point lo- 
cated at the place where movement 
began, because without it the subject 
probably would have tended to view 
the entire stimulus sequence with the 
same area of the retina. Following the 
cessation of the stationary stimulus, the 
moving stimulus appeared at the point 
of fixation and moved from left to 
right. 

If the neural elements involved in 
the excitation produced by a stationary 
stimulus comprised a totally different 
population of elements from those in- 
excitation from a moving 
then no summation could 

On the other hand, max- 


during movement 


volved in 
stimulus, 
take place. 
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imum summation would occur when 
identical elements comprised both pop- 
ulations. It was predicted that the 
greater the spatial displacement (the 
less the summation), the more adverse 
would be the effect upon contour 
formation. 

Data obtained confirmed the spatial 
displacement hypothesis. Figure 8 
presents the time-velocity functions ob- 
tained under several experimental con- 
ditions. These data are from eight 
subjects, and the procedure employed 
in obtaining the data was the same as 
that indicated previously.  (Differ- 
ences in the origin of the time-velocity 
functions which exist from one experi- 
ment to another are assumed to be 
due to individual differences, and dif- 
ferences in illumination and contrast.) 
The control condition consisted of hav- 
ing the stationary stimulus appear at 
the same place in the visual field from 
which movement began. Displace- 
ments were measured in degrees of 
visual angle at 1 and 2 degrees right, 
and 1 and 2 degrees left. Displace- 
ments of equal extent to the right and 
left produced equivalent shifts in the 
functions and so data were combined 
across this variable. 

Temporal displacement. If there 
were a time lapse between the cessa- 
tion of the initial stationary stimulus 
and the onset of the moving one, then 
subsequent summation or neural ac- 
tivity should be reduced. This reduc- 
tion in summation would increase the 
time required to obtain S, and would 
minimize the influence of the stationary 
stimulus upon the perception of con- 
tour of the moving phase. 

Judgments of contour relating to 
this issue were obtained for eight sub- 
jects under each of three conditions. 
In the control condition the two phases 
of the stimulus tem 
porally contiguous, where in the other 
two conditions temporal delays of 40 


sequence were 
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Fic. 8. The effects of spatial separation 
of the two phases of stimulation on dynamic 
contour. (Combined data of eight subjects. 
Stimulus brightness, 2.0  footlamberts; 
brightness of background, 70.0 footlamberts. 
Stimulus size, 0.5 degree. Extent of move- 
ment, 5 degrees of visual angle. Increasing 
the spatial separation between fixed phase 
of stimulation and moving phase required 
a greater exposure duration of the fixed 
phase in order to retain contour perception 
for a particular velocity of movement. Dis- 
placements to the right and left produced 
equivalent effects. Therefore, data were 
combined across this parameter.) 


and 80 millseconds were employed. 
We predicted that the temporal delays 
would shift the time-velocity functions 
to the left indicating, that for a par- 
ticular velocity, longer exposures of 
the stationary stimulus would be re- 
quired to produce contour formation. 
The amount of neural activity available 
to sum with that produced by the 
moving stimulus is determined not 
only by temporal displacement of the 
two phases of the stimulus sequence, 
but by the duration of the stationary 
stimulus as well. Accordingly, we 
hoped to obtain data suggesting that 
what is lost to the summative process 
as a result of temporal delay can be re 
covered by increasing the value of t,. 
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Data relevant to this problem are 
presented in Figure 9. The shift in 
the contour function as a result of tem- 
poral displacement is apparent. The 
greater the delay, the larger the shift 
to the left from the control condition. 
Presumably, a delay longer than 80 
milliseconds might produce a_time- 
velocity function asymptotic to the 
ordinate, suggesting that beyond a cer- 
tain duration of temporal displacement, 
increased durations of t, would cease 
to influence the perception of contour. 
Increased values of t, can compensate 
for temporal displacement provided 
that the latter is not too long. For 
example, at a velocity of 18 degrees 
per second the duration of the sta- 
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Fic. 9. The effects of temporal separa- 
tion of the two phases of stimulation on 
dynamic contour. (Combined data of eight 
subjects. Brightness of stimulus, 2.0 foot- 
lamberts; brightness of background, 70.0 
footlamberts. Stimulus size, 0.5 degree of 
visual angle. Extent of movement, 5 de- 
grees. For a given velocity of movement, 
the exposure duration of the fixed phase of 
stimulation must be progressively increased 
as temporal separation increases in order 
to retain contour.) 
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tionary stimulus necessary for contour 
perception during movement was 80 
milliseconds for the control condition 
(no temporal displacement). How- 
ever, with temporal displacements of 
40 to 80 milliseconds, the durations of 
the fixed phase had to be increased to 
approximately 100 and 120 millisec- 
onds, respectively. 


DISCUSSION 


The conception of contour percep- 
tion which we have offered, while con- 
gruent with certain facts concerning 
the visual system and with particular 
experimental results, has not directed 
itself to various considerations of a 
complicating and _ relevant nature. 
These considerations include the ques- 
tion of eye movement, the locus of the 
summation process, the question of 
neural inhibition, and some general 
questions about  neurophysiological 
mechanisms. 

Eye movements. The perception of 
edges of moving stimuli obviously 
raises the problem of how ocular track- 
ing is related to the formation of con- 
tours. Surprisingly little empirical 
knowledge is available regarding the 
nature of ocular tracking, although 
one of the authors is currently inves- 
tigating this problem. Having avail- 
able a photoelectric system of measur- 
ing eye movements which is coupled 
to a system for generating moving 
targets, precise measures can be made 
of ocular tracking and these then re- 
lated to perceptual judgments of tar- 
gets whose velocity, displacement, and 
other attributes can be varied (Smith 
& Warter, 1959, 1960). Investiga- 
tions to date indicate quite clearly that 
for the border of a moving target to 
be seen clearly a high degree of ac- 
curacy in visual tracking is not re- 
quired. In fact, it appears that it is 
only for the very slow velocities (con- 
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stant) of movement (about 10°/sec or 
less) that the eye tracks with reason- 
able accuracy. The upper limit of 10 
degrees per second, it is important to 
note, is approximately the velocity of 
movement at which the borders of 
the moving stimulus begin to be seen 
as blurred when the duration of the 
stimulus in a fixed position before 
movement (t,) is zero. It will be re- 
called that the most important single 
fact reported in our contour experi- 
ments is that by increasing the interval 
of exposure of the stimulus in a fixed 
position prior to movement we can 
systematically increase the upper limit 
of velocity at which the edge of the 
stimulus still can be seen clearly. A 
basic question raised by our experi- 
ments concerns how an increase in t, 
affects visual tracking. A provisional 


answer to this question based upon eye 
movement data to date is that increas- 
ing the magnitude of t, improves visual 
tracking generally but not to a level 


of accuracy characteristic of tracking at 
low velocities. About all we can con- 
clude at this point is that the facilita- 
tion of dynamic contour perception by 
exposure of the target in a fixed posi- 
tion prior to movement is in some 
manner directly related to the accuracy 
of visual tracking. We can be sure 
that in our contour experiments the 
displacement of the stimulus at a con- 
stant velocity did not produce on the 
retina a displaced locus of maximum 
excitation which also was displaced 
at a constant velocity. We must as- 
sume that the cortical representation 
of a contour must contain components 
of acceleration and deceleration not 
found in the original stimulus. This, 
of course, is due to irregularities in the 
pursuit movement of the eyes under 
conditions wherein the moving stimu- 
lus alone is presented, and it is due to 
inaccurate tracking and the summation 
process under conditions in which both 
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phases of the stimulation sequence are 
presented (see Figure 6). 

Based upon eye movement data we 
can offer a suggestion concerning the 
possible relation between dynamic con- 
tour perception and correlated visual 
tracking movements. If one examines 
the magnitudes of lag and lead errors 
of the eye as the velocity of a stimulus 
increases, it is obvious that the mag- 
nitude of these errors increases as 
velocity increases, and at any given 
velocity it decreases at t, is increased. 
It is our hypothesis that as long as 
the lag or lead error does not exceed 
a few degree of angle (3-4 degrees at 
most), contour perception can be 
maintained. If this hypothesis is sup- 
ported by the experiments now in 
progress, it will converge nicely with 
the facts about the maximum angular 
separation over which optional ap- 
parent movement can be obtained (De 
Silva, 1929), and the maximum dis- 
tance over which one contour can exert 
an inhibitory influence over another 
(Fry & Bartley, 1935). Moreover, 
it is very interesting to note, and per- 
haps very important that the figure of 
3-4 degrees angular magnitude is just 
about the visual field size represented 
by the measured size of receptive fields 
as revealed by Kuffler (1953). 

The data obtained so far on the 
problem of eye movements tend to 
complicate the model, but do not viti- 
ate it. Since the data on eye move- 
ments are at present incomplete we 
have assumed in the description of the 
summation process (see Figure 5) that 
the eye lags behind the target in the 
hypothetical situation depicted. If 
our eye movement data provide us 
with data contrary to this assumption, 
then the effect would be to augment 
the summation process by keeping the 
growth of neural excitation produced 
by the moving stimulus nearer the 
population of neural elements under- 
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going decay following the cessation 
of the stationary stimulus. Again, the 
important point is that the general 
statement of the model can accommo- 
date the complicating factor of eye 
movements. 

Locus of summation. Since the first 
statement of statistical theory by Mar- 
shall and Talbot (1942), they and sub- 
sequent statistical theorists have placed 
the locus of contour resolution in the 
striate cortex. The primary role of 
the projection system has been seen 
as contributing to the peaking of the 
distribution of neural excitation. The 
locus of summation of different dis- 
tributions of excitation, however, has 
been the visual cortex alone. The 
question may be raised, however, as 
to the possibility of summation at 
lower levels. In an effort to ascertain 
the likelihood of this possibility we 
executed an experiment in which the 
stationary stimulus was presented to 
one eye whereas the moving stimulus 
was presented to the contralateral eye 
(Gulick, 1960). The results of this 
experiment suggest that for maximum 
contour facilitation it is necessary to 
present the stimulation sequence in 
such a way as to allow neural inter- 
action and/or summation at the retinal 
level as well as at the cortical level. 
The implications of these data for the 
model involve a possible extension of 
the present view, of neural summation 
as it is conceived in current statistical 
theory. 

The 


subsequent 
neural summation) of two temporally 
and spatially separate visual events 
which is central to our position raises 
in addition certain questions which re- 


interaction (and 


quire further investigation. We have 
presented evidence concerning the ef- 
fects of separating in space and time 
the initial exposure of the stimulus in 
a fixed position and the movement 
component. It is necessary to ask 
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what effects upon the perception of 
edge of the moving stimulus would be 
observed if the fixed and moving com- 
ponents of presentation differed in 
terms of certain attributes which have 
yet to be studied. For example, sup- 
pose that the stationary stimulus prior 
to movement were exposed out of 
focus, to be followed by the moving 
with the stimulus in focus. 
What then would be the change in our 
typical relation between velocity and 
t, as far as perceiving the moving edge 
is concerned? Similar important ques- 
tions also need to be studied once we 
have completed the instrumentation 
to present stimuli in this 


phase 


necessary 
manner. 
Neural inhibition and facilitation. 
\s a final point of discussion we real- 
ize, as must the reader, that our con- 
ception of contour and its expression 
in neurological and physiological terms 
is unbalanced. The theory rests prin- 


cipally upon the concept of neurol sum- 
mation and says little, if anything, 
about related concepts of inhibition and 


facilitation. In view of the inhibitory 
effects that one contour can have upon 
an adjacent one when both are fixed 
in position (Fry & Bartley, 1935), we 
are acutely aware that similar effects 
may be present when a particular con- 
tour is displaced relative to its original 
position. We think it reasonable, for 
example, to assume that an inhibitory 
mechanism can account for the “damp- 
ing” of residual retinal activity corre- 
lated with the “trailing” 
moving stimulus. In any 
assume that future and more sophis- 
ticated expressions of statistical theory 
will incorporate modes of neural inter- 
action in addition to summation at the 
cortex. 


edge of any 
event we 


Regarding possible future expres- 
sions of statistical theory of contour 
we can indicate briefly some of the 
things we believe ought to be done. 
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Additional perceptual studies are 
needed in which systematic investiga- 
tions of the effects of differences in the 
fixed and moving components of the 
stimulus presentation are made. More- 
over, we need to know more about the 
relationships between tracking mo- 
tions of the eye and various conditions 
of stimulus movement. Finally, and 
most important, direct evidence bear- 
ing upon the neurological basis of 
perception is required if we are to 
continue to use with confidence the 
statistico-neurological assumption upon 
which the theory rests. While con- 
gruent with some neurological evi- 
dence, statistical theory of contour 
perception is far from being firmly 
based upon unambiguous neurological 
and physiological data. And, of 
course, the theory cannot find its real 
tests in perceptual data alone. We 
must continue to search for direct 
measures of the integrated activity of 
the visual system itself in relation to 
the data of perception. By so pro- 
ceeding we will determine if statistical 
theory will prove itself as a fruitful 
means of conceptualizing basic prob- 
lems in perception. 
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SUBJECTIVE PROBABILITIES INFERRED 
FROM DECISIONS! 


WARD EDWARDS 


Engineering Psychology Laboratory, University of Michigan 


Theories about how people make 
decisions in risky or uncertain situa- 
tions have come to focus on two con- 
cepts: utility, or subjective value, and 
subjective probability. The status 
of the utility concept is fairly clear 
(for reviews, see Edwards, 1954c, 
1961). But the status of the sub- 
jective probability concept is quite 
confused, both theoretically and in 
relation to experiments. Too often 
the concept of subjective probability 
has been introduced casually into a 
decision theory which focuses on the 
concept of utility. In fact, the only 
thorough treatment of a concept of 
subjective probability (quite different 
from the one to be discussed in this 
paper) is L. J. Savage’s (1954) bril- 
liant book. The notion of subjective 
probability would be in 
psychology even if it played no role 
in decision theories, since men often 
make judgments about probabilities 
based on displays of random or non- 
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random events. But this paper is 
mostly concerned with subjective 
probabilities inferred from decisions 
via some kind of value or utility 
maximization model of the decision- 
making process. 

Psychologists trained in psycho- 
physics tend to think that subjective 
probability is related to objective 
probability in more or less the same 
way that the subjective loudness of a 
tone is related to its objective in- 
tensity. The purpose of this paper is 
to examine the content, merits, and 
limitations of such an approach. The 
discussion will focus on two closely 
related matters. The first is the 
idea of a set of functions relating 
subjective to objective probability. 
The second is whether or not the sub- 
jective probabilities of a set of mu- 
tually exclusive events, one of which 
must happen, should add up to one. 


The paper begins by defining two 


of decision theories. After 
some preliminary discussion of utility 
and subjective probability functions, 
it next considers the class of theories 
which result when subjective proba- 
bilities are assumed to add up to one. 
This class turns out to have some 
serious difficulties. A brief review of 
experimental evidence provides em- 
pirical reasons for avoiding these 
difficulties by rejecting additivity. 
Then the paper examines models 
which do not require subjective proba- 
bilities to add up to one. Such models 
require a reformulation of the concept 
of utility; utility scales must have 
true zero points. Finally the paper 
derives two experimentally testable 


classes 
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properties which any plausible utility- 
subjective probability 
tive or 
presents a 
tractive. 


model, addi- 
must have, and 


which I find at- 


otherwise, 
model 


MAXIMIZATION AND UTILITY 


First, some definitions and assump- 
tions. This paper is concerned with 
decisions of the following kind. A 
decision maker can choose one and 
only one of two or more courses of 
action. Associated with each course 
of action is a finite set of possible 
outcomes, one and only one of which 
will happen if that course of action is 
chosen. Each possible outcome can 
be described by a number, called a 
utility, defined at least up to a linear 
transformation (i.e., measured at least 
on an interval scale); this number 
fully represents the desirability of that 
outcome maker. If 
more than one possible outcome is 
with a given course of 
action, then a well defined external 
event, over which the decision maker 
has no control, determines which 
outcome will occur if that course of 
action is Most 
decisions fit this description. 

The crucial assumption of the class 
of theories considered here is that 
people behave as if they were attempt- 
ing to maximize quantity 
perhaps not skillfully, perhaps not 
consistently, but still attempting to 
maximize. Maximize what? The 
following equation defines the con- 
cept of subjectively expected utility 
(SEU): 


to the decision 


associated 
nonsocial 


chosen. 


some 


[1] 


SEU = ¥ yun 


It refers to a course of action which 
has a number of possible outcomes. 
The i“ possible outcome has an objec- 
tive value to which corresponds a 
subjective value or utility «;. Out- 
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come 7 will occur if a given external 
event occurs; that event may have an 
objective probability, or the objective 
probabilities of one or more of the 
relevant set of external events may 
be undefined. In either case, out- 
come 7 has a subjective probability 
vy; The theories with which this 
paper is concerned simply assert that 
subjects (Ss) behave as though they 
choose, from among the courses of 
action open to them, the one which 
has the largest SEU. For more ex- 
tended discussion of the origin of 
these models, see Edwards (1954c, 
1955). This paper will treat them as 
models of what men do (descriptive 
models) rather than of what they 
should do (normative models). 

Equation 1 introduces formally the 
central concept of this paper: sub- 
jective probability. What is a sub- 
jective probability? It is a number 
between zero and one which describes 
a person's assessment of the likeliness 
of an event. Exploration of several 
more explicit definitions follows. 

This paper assumes that objective 
probabilities exist, are known for some 
events, and are distinct from subjec- 
tive probabilities, though subjective 
probabilities are related to objective 
Some theorists like Savage 
(1954) have asserted that there is no 
such thing as objective probability; 
all probabilities are subjective. This 
is no place to argue whether or not 
objective probabilities can be mean- 
ingfully defined. The argument is 
irrelevant. Some events, like the toss 
of a coin or the roll of a die, have well 
established ‘“‘conventional’”’ probabili- 
ties. Whether these probabilities are 


ones. 


objective or not, they can meaning- 
fully be compared with the subjective 
revealed when 


probabilities people 


make decisions which involve these 
events. 


Of course many events exist which 
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are neither certain nor impossible but 
for which no objective or conventional 
probability can be defined. Example: 
Within the next 24 hours, you will 
drink a glass of wine. Such events are 
called uncertain, while events for 
which objective probabilities can be 
defined are called risky. This paper 
will not consider uncertain events. 
The same decision theory ought to 
apply to decisions whose outcomes 
are contingent on uncertain events as 
to decisions whose outcomes are con- 
tingent on risky events, and the latter 
are more helpful in examining the 
nature of that theory. 

the notion of a set of functions 
relating subjective to objective prob- 
ability is vital to this paper. But the 
theorems proved below are relevant 
also to theories which deny or ignore 
the existence of such a relationship. 
They imply for such theories the 
existence of severe restrictions on per- 
missible relationships between sub- 


jective probabilities and classes of 
risky events (classes defined by ob- 
jective probability values). 

From here on, some abbreviations 


needed. The word ‘‘maximiza- 
tion” will be omitted from phrases 
like “the SEU maximization model.”’ 
The concept of objective probability 
will be symbolized OP in text; SP will 
stand for subjective probability. The 
concept of a set of events of which 
some one must happen and not more 
than one can happen—that is, the 
concept of an Exclusive, Exhaustive 
set of Events—will be symbolized 
EEE in text. Two classes of SEU 
models exist. Those in which the 
SPs of an EEE must add up to a 
constant will from here on be called 
ASEU (additive subjectively expected 
utility maximization) models. Those 
in which the SPs of an EEE do not 
need to add up to anything in particu- 
lar (though of course the OPs of an 
EEE must always add up to one) will 


are 
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be called NASEU (nonadditive sub- 
jectively expected utility maximiza- 
tion) models. 

The subject of transformations will 
come up. Psychologists speak of three 
classes of scales, defined by three 
transformation groups, which are 
relevant here. An interval scale 
permits the linear transformation 
x’ =ax+b6, a>O0. A ratio scale 
permits the scale transformation 
x’ = ax, a>O. An identity scale 
permits only the identity transforma- 
tion x’ = x. This paper will use the 
psychological rather than the mathe- 
matical names for these classes of 


scales. 


CLASSES OF SEU MOopELs 


This section will identify two main 
classes of SEU models, ASEU and 
NASEU, and will then examine some 
extraordinary restrictions on the util- 
ity functions which must enter into 
each. The difference between ASEU 
and NASEU models is defined by 

Theorem 1: If only interval, ratio, 
and identity scales are considered, in 
any SEU model in which the SPs of 
an EEE must add to a fixed constant 
(ASEU model) utility must be meas- 
urable at least on an interval scale 
and SP must be measurable at least 
on a ratio scale. In any SEU model 
in which no restrictions are put on the 
sum of the SPs of an EEE (NASEU 
model), utility and SP must both be 
measurable at least on ratio scales. 

Consider the NASEU models first. 
Since the transformations under con- 
sideration are successive restrictions 
of the general linear transformation, 
consider the least restrictive possi- 
bility first. In that case, the trans- 
formation u’ =au+b, a>O on 
utility and y = cp) + d,c > 0 0n SP 
are permissible. Consider two courses 
of action, A and B, which have equal 
SEUs. (Continuity assumptions not 
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spelled out here are necessary to guar- 
antee that such events exist and that 
they do not have certain pathological 
properties. Axioms 1 and 2 of the 
mathematical Appendix, plus the 
assumption that there are at least 
three possible outcomes with different 
utilities, would be more than enough.) 
Permissible transformations must not 
change equal SEUs, which imply 
indifference between A and B, to 


unequal SEUs, which imply that one 
is preferred to 
quently : 


> ¥(Aj)u(A) => V(B)u(Bjeo 
¥ W(Adw' (A) = (B;/u'(B;) [2] 


the other. Conse- 


A; = i” possible outcome of course of 


A, @ 329, ***, Fi, 


action 
B; = j™ possible outcome of course of 


1,2,--:,J. 


action B, j = 


Substituting the transformations 
under consideration into the second 
half of Equation 2 produces: 


> (ed Ai] + d) (aul A,) + 5) ]) 


= ¥ [ (coy B;] + d)(au[B;] + 6)] 


Multiplying and collecting terms: 


acL> ¥(Aju(A,) — ¥ o(By)u(B;) ) 


7 


+ b> ¥(A) — ¥ V(B))) 
+ ad[> u(A,) — ¥ u(B;) ] 


7 


+bdfI-—J]=0 [3] 


By hypothesis: 


> ¥(Aju(A,) — & v(B;)u(B;) = 0 


and so the first term of Equation 3 
vanishes. Since both 6 and d may 
assume any arbitrarily chosen values, 
the only way to guarantee that the 
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sum of the other three terms will be 
zero for any values of 6 and d is to 
require that the quantity inside the 
brackets in each term be zero. But 
the quantity inside the brackets in 
the second term (and in the third also) 
is explicitly not required to be zero. 
Consequently this set of transforma- 
tions cannot be permissible. Since 
both a and ¢ are required to be greater 
than zero, the only way to guarantee 
that Equation 3 will be correct is to 
require that both 6 and d be zero. 
That, of course, means that both 
utility and SP must be measured on 
ratio scales. Thus the second half of 
Theorem 1 is established. 

Now consider the first half of 
Theorem 1, which is concerned with 
ASEU models. In such models, the 
first and second terms of Equation 3 
are zero by hypothesis. Furthermore, 
it is clear that in such models SPs may 
not have an arbitrary origin, since 
they are required to add up to a fixed 
constant, and consequently d must be 
zero. If dis zero, the other two terms 
vanish, and no restrictions on } are 
necessary. This establishes the first 
half of Theorem 1. 


Possible Relations between Utilities and 
Objective Values 


Luce (1959) has recently pointed 
out some surprising and severe restric- 
tions on possible relationships between 
objective and subjective continua of 
any sort, in a paper of extraordinary 
depth and importance. He asserts 
two principles to which any function 
relating an independent to a depend- 
ent variable within a theory should 
conform. First: 


Admissible transformations of one or more 
of the independent variables shall lead, via the 
substantive theory, only to admissible trans- 
formations of the dependent variables. (p. 
85.) (Note that a form of this principle was 
used to prove Theorem 1.) 
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Secondly : 


except for the numerical values of param- 
eters that reflect the effect on the dependent 
variables of admissible transformations of the 
the mathematical 
the substantive theory shall be 
independent of admissible transformations of 


(p. 85.) 


independent variables, 


structure ol 
the independent variables. 


The first condition means, for ex- 
ample, that a theory in which the 
dependent variable is measured on a 
ratio scale may not require that when 
the arbitrary origin of an interval 
scale independent variable is changed, 
the origin for the dependent variable 
must change also. The second, more 
subtle, condition means that the 
mathematical form of the function 
relating independent to dependent 
variables may not be changed from, 
for example, a power function to a 
simple sum of variables by permissible 
transformations on the independent 
variables. Both of these principles 
of theory construction seem plausible 
and even necessary. What do they 
imply for utility and SP? 

First, consider interval scale utility 
as dependent variable and ratio scale 
money (or quantity of any other con- 
tinuously variable good) inde- 
pendent variable. The relation be- 
tween these two variables, if it is to 
conform to the two requirements of 
the previous paragraph, must, ac- 
cording to Luce’s theorems, have the 
form u($,) = a log $; + b, where a 
and 6 are arbitrary, constants and §$; is 
any amount of money. If, as is neces- 
sary in NASEU models, utility is 
measured on a ratio scale, the rela- 
tion between it and money must, 
according to Luce’s theorems, be 
u($;) = a$,° (again, a and Bb are arbi- 
trary constants). 

These restrictions on the possible 
forms of utility functions are astonish- 
ingly severe. For example, they im- 
ply that if for interval scale utilities 
there are any points on the function 


as 
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relating utility of money to its objec- 
tive value at which that function 
increases (that is, if @ is positive), and 
if the value ot 0 is finite, which it must 
be if the equation is to make sense, 
then the utility corresponding to zero 
dollars (or some fixed positive or 
negative amount, since the origin 
may be displaced by a fixed amount) 
is minus infinity. The utility of nega- 
tive amounts of money (or amounts 
less than the fixed amount) is unde- 
fined. And the form of the utility 
function is necessarily negatively ac- 
celerated; that is, it is impossible to 
find a person who attaches as much as 
twice as much utility to $2x as he does 
to $x. In short, interval scale utilities 
associated with ratio scale money 
values require ridiculous utility func- 
tions. 

The relation between ratio scale 
utilities and ratio scale money values 
is much less ridiculous. Its most 
important restrictive characteristic is 
that it may have only one, inflection 
point. That means that if the func- 
tion is ever positively accelerated, it 
is always positively accelerated; if it 
is ever negatively accelerated, it is 
always negatively accelerated. Even 
this characteristic is pretty restrictive. 
But it is not utterly ridiculous, 


as are the properties of interval 


scale utilities discussed in the previous 
paragraph. 

These restrictions are so surprising 
that one is naturally tempted to try 
to wiggle out of them by arguing 
against the assumptions which lead to 


them. Luce (1959) has extensively 
discussed these assumptions and their 
possible limitations. This paper will 
not cover the same ground again; 
instead, I will simply record my 
opinion that at least for interval 
scales Luce’s restrictions are inevi- 
table and we must therefore learn to 
live with them. 

Do these principles of theory con- 
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struction place any restrictions on 
possible relations between SP and 
OP? Probably not. We _ usually 
consider OP to be measured on an 
identity scale; if it is, Luce’s argu- 
ments place no special restrictions on 
the mathematical form of possible 
relationships between it and anything 
else. 


GENERAL CHARACTERISTICS OF 
PSYCHOPHYSICAL DECISION 
MOopDELS 


Before getting down to specific 
issues concernings SPs, I must set 
the stage by discussing some general 
issues and some characteristics which 
all ASEU or NASEU models must or 
should have. 


Identifying Characteristics 


To predict choices from an SEU 
maximization model of any kind, you 
must know the utility and the SP of 
each possible outcome of each avail- 
able course of action. You can then 
substitute these values into Equation 
1, perform some arithmetic, and pre- 
dict that the course of action with the 
highest SEU will be chosen. Such 
predictions will be wrong on occasion. 
This fact can be included in the model 
(at a heavy penalty in additional 
mathematical complexity) by making 
it stochastic, or it can be excluded 
from the model and left to a general 
theory of errors, as has been cus- 
tomary in most of psychophysics. 
This paper does the latter, but it 
seems likely that many of the conclu- 
sions established here for deterministic 
models will apply to stochastic models 
of the same type as well. 

In order to use an SEU model, then, 
you need to know utilities and SPs 
independently of the decision you are 
trying to predict. How can you find 
them? That is a very difficult, much 


Warp Epwarps 


debated question. For the purposes 
of this paper, it is enough to assume 
that some psychophysical method is 
available which gives accurate utility 
and SP measures for any object or 
event of interest, and that you have 
available such information about any- 
one whose decisions you are predicting. 

In what form can such information 
be available? Consider first informa- 
tion about utilities. If money is the 
valuable object in question, presum- 
ably a table could be prepared show- 
ing the amount of utility an S at- 
taches to a gain of one dollar, of 
two dollars, of three dollars, and so 
on. (My reasons for saying ‘‘a gain 
of one dollar” rather than simply the 
amount one dollar or a total fortune 
of $5001 will become clearer later; 
the differences among these formula- 
tions are irrelevant at this point.) 
But the information in that table 
would have to be based on experi- 
ments in which the S received or lost 
particular dollar bills, with particular 
serial numbers. How do we know 
that he will respond similarly to differ- 
ent dollar bills, or perhaps even to 
nickels and dimes instead? Of course 
we don’t. We simply assume that 
(within reasonable limits) the form 
in which S receives the increment of 
one dollar is irrelevant; that is, we 
abstract the property ‘‘United States 
currency with an objective value of 
one dollar’ from a variety of other 
properties of the stimuli used in the 
utility measurement experiment, and 
assume that it is the one on which the 
results most significantly depend. 

In the case of utility, the point 
made above is so familiar as to be 
trivial. But what about the case of 
SP? On what properties of events do 
SPs most significantly depend? The 
most obvious characteristic of events 
which should turn out to be highly 
correlated with SP is, of course, OP 
(remember that this paper is re- 
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stricted to classes of events for. which 
OPs are well and independently de- 
fined). Unfortunately, unpublished 
data make it perfectly clear that if 
two bets are identical in payoff, OP, 
and all other characteristics except the 
means by which the OP is displayed, 
S may markedly prefer one to the 
other—so much so that he is willing to 
pay for the privilege of using his pre- 
ferred display, even though he loses 
in expected value (hereafter abbre- 
viated EV) by doing so. Within the 
framework of the theories discussed 
in this paper, this finding can be ex- 
plained only by assuming that several 
different SPs may correspond to a 
given OP, depending on the nature of 
the OP display. In short, any at- 
tempt to state SP as a function of OP 
alone is doomed to failure. 

What then is SP a_ function 
of? Mathematical decision theorists, 
aware of logical difficulties which 
result from thinking of SP as a func- 


tion of OP, often use a quite different 


approach. They think of SP as a 
characteristic of interaction between 
a specific event and a specific person. 
This, of course, means that in order 
to make a prediction they must inde- 
pendently determine the SP asso- 
ciated with the particular event about 
which the prediction is made for the 
person whose behavior is to be pre- 
dicted. Such a strategy for theorizing 
has two major difficulties. The first 
is that it makes little attempt at 
generality. Obviously some charac- 
teristics of an event are important in 
determining its SP and some are not; 
we need to know which are which. 
Furthermore, if SPs are to be used to 
predict real-world decision making 
behavior (e.g., for designing displays 
in military information processing 
systems), some set of rules which 
govern the relationship between event 
and resulting SP must be found so 
that predictions can be made without 
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running experiments on each situa- 
tion and person to which the predic- 
tions are intended to apply. 

What we need and do not have is 
some systematic empirical informa- 
tion about various displays of proba- 
bilities. In the absence of identified 
stimulus dimensions relevant to SP 
other than OP, decision theorists are 
in as awkward a position as psychol- 
ogists interested in loudness would be 
in if nothing were known about the 
physics of sound. Of course this 
position can only be made worse by 
ignoring the one stimulus variable 
which we know to be highly corre- 
lated with SP and therefore im- 
portant: OP. But OP is not enough. 
Decision theorists cannot much longer 
get along without empirical informa- 
tion about the psychophysics of proba- 
bility displays. 

Since we have no useful information 
about displays of probabilities, this 
paper can only assume that they are 
important but leave their effects un- 
specified. The following discussion 
assumes that each event to which a 
decision theory might be applied has 
certain as yet unspecified character- 
istics which determine its SP. One 
of those characteristics is, of course, 
OP, whenever OP is defined. The 
others are unknown; this paper will 
call them identifying characteristics. 
Only two assumptions about identify- 
ing characteristics will be consistently 
made. The first is that whenever an 
event is repeated as exactly as pos- 
sible, its identifying characteristics 
are unchanged ; in other words, identi- 
fying characteristics are recurrent, not 
unique. (The contrary possibility 
would make them inaccessible to 
scientific study.) The second assump- 
tion is that it is quite possible to 
have events with different identifying 
characteristics but the"same OPs; this 
simply means that OP is not perfectly 
correlated with identifying charac- 
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teristics. (If it were, then Theorem 3 
below would imply that SP always 
equals OP, and so the point of the 
notion of identifying characteristics 
would be lost.) 


The SP Function Book 


Mathematically, this paper will 
conceive of SP as a function of OP 
and of identifying characteristics. 
One way to think about this function 
is to imagine a book. Each page of 
this book has on it a page number (the 
quantity r of the Appendix) and a 
function which relates SP to OP. 
These functions need not necessarily 
be continuous. The page number is 
simply a code which refers to a par- 
ticular set of identifying character- 
istics. Thus, given any event, you can 
find its SP by finding out its identi- 
fying characteristics, looking up the 
number corresponding to that par- 
ticular set of identifying character- 
istics, finding the page in the book 
which has that page number, and then 
looking up the SP corresponding to 
the OP of the event in question on 
that page. Mathematicians may find 
this concept and the notation of iden- 
tifying characteristics clearer if they 
examine the axiom system and proofs 
of Theorems 2 and 3 in the Appendix. 
The assumptions of that proof for- 
mally define ASEU models of this 
class. It turns out that the formula- 
tion in terms of a function book, even 
if that book may have an infinite 
number of pages, implies that SP 
always equals OP if additivity of SPs 
is assumed. But the notion of an SP 
function book is appropriate for 
NASEU models, for which no such 
implication exists. 

The idea of the previous paragraph 
should be familiar to psychophysicists. 
This model is analogous to a familiar 
model in auditory psychophysics. 
Perceived loudness of pure tones is a 
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function of both stimulus intensity 
and stimulus frequency. A nuniber of 
different functions relating loudness 
to intensity, each using a different 
value of frequency as a parameter, 
are all an auditory psychophysicist 
would need to make predictions about 
loudness. (He would actually use 
equal loudness contours for the pur- 
pose, but the underlying model is the 
same. ) 

The purpose of this 
provide an orderly way to deal with 
the possibility that more than one SP 
may correspond to a given OP, de- 
pending on display characteristics. 
Of course, I hope that such a book 
of SP functions would have relatively 
few pages, and an orderly structure, 
so that the sequence of pages is not 
arbitrary. But neither of these char- 
acteristics need be the 
following discussion. 

Incidentally, uncertain events as 
well as risky ones can be provided for 
within the notion of a_ book of 
SP functions. Presumably uncertain 
events will have identifying charac- 
teristics different from risky events 
(indeed having no definable OP is 
presumably itself one identifying char- 
acteristic), and so pages appropriate 
for such events can be segregated in 
one section of the book. Each such 
page has, as its function relating SP 
to OP, a flat horizontal line running 
from OP = 0 to OP = 1. (More 
precisely, this line is defined only for 
0 < OP <1.) Thus once its identi- 
fying characteristics are known, the 
SP of an uncertain event is completely 


notion is to 


assumed in 


specified. 


ASEU MopeE-Ls 


In spite of major differences in the 
axiom systems which lead to them 
and in the philosophy which motivates 
them, for the purposes of this paper 
most of the ASEU models are very 
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similar mathematically and opera- 
tionally to expected utility maximiza- 
tion models (hereafter called EU 
models) like the one proposed by von 
Neumann and Morgenstern (1947). 
The only difference, and it is a crucial 
one, is the use of SPs instead of OPs 
in the models. 


Finite vs. Infinite Event Sets 


Two main classes of possible ASEU 
models are defined by the dichotomy 
between finite and infinite event sets. 
Mathematically, the distinction is 
simple enough. If we consider all 
possible outcomes of 10 tosses of a 
coin, then this is a finite event set. 
If we throw a dart at a target, then 
all possible places it can hit form an 
infinite event set (note that the point 
of the dart need not be a mathematical 
point with zero area). Obviously a 
person can encounter only a finite set 
of events in his lifetime, but that is 
irrelevant. The point is that he will 
encounter many situations in which 
some one of an infinite set of possible 
may At first glance, 
it would seem desirable, then, to have 


events occur. 
a decision model capable of coping 
with any member of an infinite event 
set—and most decision models are 
just that. A few theorists (notably 
Davidson, Suppes, & Siegel, 1957) 
have preferred to think about finite 
event mostly because they 
wanted to be able to check all of 
their predictions, instead of some 
finite subset of an infinite set of 
predictions. As will become clear 
later, finite event immensely 
complicate the mathematics, and at 
the same time reduce the value of the 
model by making it inapplicable to 
situations in which the set of possible 
events is infinite. Although this 
paper discusses finite models below, | 
consider such models far less interest- 
ing than the infinite models, 


sets, 


sets 
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Please remember that the notion 
of an infinite event set is not the same 
thing as the notion of a continuum of 
OPs. If all elements of a continuum 
of OPs are realizable by members of 
an event set, then that set is infinite 
(nondenumerably infinite, to be pre- 
cise), but an infinite set of events, all 
with different OPs, may still have the 
property that between any two OPs 
which are associated with members of 
the event set lies an OP which is not. 
In fact, continuity of OPs is not 
assumed directly in Theorems 2 and 
3 below, though it can be deduced as a 
theorem from the nonatomicity prop- 
erty which is assumed. 


Restricted vs. 
Functions 


Unrestricted Sets of SP 


How many different SPs may be 
associated with different events all 
having the same OP? Or, to ask 


essentially the same question in a 


different form, how many different 
functions may relate SP to OP? Still 
another formulation: how many pages 
may there be in the SP function book ? 
Three classes of answers to this ques- 
tion exist for infinite event sets, and 
two for finite ones. For infinite event 
the number of different SPs 
which may be associated with differ- 
ent events all having the same OP 
may be finite or infinite. If that 
number is infinite, it may be de- 
numerable (capable of being set in 
one-to-one correspondence with the 
integers) or nondenumerable (the set 
of all points on a line segment is 
nondenumerable; so is the set of all 
OPs). It turns out that the finite and 
denumerably infinite cases have the 
same properties, so only the latter, 
less restrictive case will be considered. 

The formulation of the relation be- 
tween OP and SP in terms of the SP 
function book tacitly presupposes 
that the set of different SPs which 


sets, 
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may be associated with different 
events all having the same OP is at 
most denumerably infinite. If, as 
the preceding discussion has _ sug- 
gested, the difference between two 
different events with the same OP 
but different SPs lies in the way in 
which that OP is displayed, then the 
assumption of denumerability is sim- 
ply the assumption that all possible 
ways in which an OP could be dis- 
played could be listed (in infinite 
time). It is difficult to see why any- 
one would resist so mild an assump- 
tion—and yet Theorem 3 will show 
that its consequences are drastic. 
Nevertheless, it seems greedy for a 
theorist to insist that a denumerably 
infinite set of different SPs to asso- 
ciate with a given OP is not enough 
and that he wants more. I prefer to 
accept denumerability and abide by 
its rather drastic consequences. 

For mathematical convenience, 
however, it is expedient first to ex- 


amine a restriction wich applies even 


to the nondenumerable case. It is 
well known that an infinite number 
of different probability measures can 
be defined on a given event set. This 
makes it seem unlikely that any sub- 
stantial formal restrictions exist on 
models of the nondenumerable ‘‘any- 
thing can happen”’ sort. But there 
is at least one. Usually SPs rise with 
OPs. In fact, it is tempting to assume 
that they always do so. Consider two 
events, A and B. If the OP of A is 
higher than the OP of B and the SP 
of A is less than the SP of B, call that 
pair of events an inversion. The 
assumption that SPs always rise with 
OPs is an assumption that inversions 
cannot occur. But unless inversions 
exist in infinite numbers and at all 
values of OP, then it is easy to prove 
that SP always equals OP. Theorem 
2 below states a stronger assertion 
from which this one is a self-evident 


( onseq uer c. 
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Theorem 2: Assume that an infinite, 
nonatomic event set is being consid- 
ered, that every event has an SP and 
an OP, that the SPs of any EEE must 
add to one, and that some event can 
be found for which SP # OP. It 
follows that for any OP p, 0 < p < 1, 
two events A and B can be found such 
that p(A) = pand p(B) < p(A) and 
¥(B) > ¥(A) + Q where Q is a non- 
infinitesimal quantity (unless the 
maximum difference between SP and 
OP is infinitesimal, or p(A) or p(B) 
is infinitesimally different from Oor 1).? 

A more precise mathematical formu- 
lation of Theorem 2, and its proof, are 
long, technical, and contribute little 
to the following discussion. I have 
therefore put them in a mathematical 
appendix which follows the list of 
references. The only important as- 
sumption necessary to prove Theorem 
2 which has not already been dis- 
cussed is that of nonatomicity; a 
discussion of it follows Theorem 3, 
for which it is also essential. Its 
content is primarily technical. The 
assumption that all events in the 
event space have both OPs and SPs 
has been discussed. The assumption 
that some event exists for which 
SP # OP is the only reason for even 
thinking about SPs in risky situations; 
without it there is no point in a theory 
about SPs. 

Lemma 4 of the mathematical ap- 
pendix implies some further state- 
ments about subjective probabilities 
in this nondenumerable case. If SP 
is ever different from OP, there must 
be some event or events for which 
the size of that difference (disregard- 
ing sign) is a maximum, or at any 
rate is only infinitesimally different 
from a maximum. In fact, because 
of the additivity of SPs there must 
be at least two such events, one in 


2 This theorem and its proof were formu- 


lated by W. M. Kincaid. 
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which the difference is positive and 
one in which it is negative. Call the 
event with the smallest OP for which 
this maximal difference is present and 
positive M. Event M has an OP 
p(M) and an SP y(M). For any 
Event A, define d(A) = ¥(A) — p(A). 
Then d(M) is the largest positive 
value of d(A) that can occur, and 
d(M) is the largest negative value of 
1(A) that can occur. Now what can 
be said about the difference between 
SP and OP for events other than M? 
For any such Event A, max d(A) 
= d(M), and min d(A) = d(M). 
Lemma 4 shows that for every proba- 
bility » there must be an Event B 
having OP p such that: 


racayy| 2 


d(B) (M) 


p(B) = p(M) 


d(B) = [d( uj] 


p(B) 
p(M) 
p(B) s p(M) 


There must also be an Event C having 
OP p such that: 


d(C) = [d( mi] q(C) 


b(M) 


= q(M) 


4} D(C) 
a 4D] 


S q(M) 


M 


p(C) 
Finally, it shows that events must 
exist having that same OP p and all 
possible values of y between ¥(B) and 
¥(C). In other words, in a space 
relating SP to OP there is a parallelo- 
gram of points defined by the four 


vertices (0, 0) 1), Lp(M), ¥(M)], 
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and [p(M), ¥(M)]. All points within 
this parallelogram must be realizable 
by events in the event space with 
appropriate subjective probabilities. 
The preceding paragraph leads up 
to Theorem 3. The awkward property 
exhibited in Theorem 2 applies to 
models which permit a nondenumer- 
ably infinite set of SPs to be asso- 
ciated with any given OP. Next 
consider what happens if any OP can 
be found (other than 0 or 1) with 
which no more than a denumerably 
infinite set of SPs is associated. 
Theorem 3: As in Theorem 2, as- 
sume an infinite, nonatomic event set, 
each event having an SP and an OP, 
and assume that the SPs of any EEE 
must add to one. In addition, assume 
that there is some OP other than 0 
or 1 which has the characteristic 
that the number of different SPs 
which occur in conjunction with 
different events all having that OP is 
at most denumerably infinite. It 
follows that SP always equals OP.* 
Again a more precise formulation 
of the theorem, and its proof, are in 
the mathematical Appendix. Certain 
additional assumptions of purely tech- 
nical importance (e.g., continuity at 
zero) are necessary to the proof, but 
all the assumptions which I consider 
at all controversial are stated here. 
At this point it is worth while con- 
sidering the assumption of non- 
atomicity which underlies Theorems 
2 and 3. Suppose I assert ‘For any 
event which is already included in 
an event set, I wish to include another 
which is defined as the simultaneous 
occurrence of the event in question 
and heads in the toss of a coin which I 
happen to have handy.”’ This asser- 
tion defines a nonatomic event set 
(in fact, it is somewhat stronger than 


* This theorem and its proof were formu- 
lated by L. J. Savage. W. M. Kincaid turned 
Savage’s sketched proof into the form con- 
tained in the Appendix, 
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is necessary to specify a nonatomic 
event set). In other words, the 
property of nonatomicity simply as- 
serts that any given subset may be 
further subdivided. Unless the set of 


events with which you intend to work 
is rigidly defined in advance, it is 
impossible, as | see it, to think of any 


event set which is not 
nonatomic. However, this property 
is far from trivial. For example, the 
assumption of nonatomicity directly 
implies that every OP is realizable 
by some event in the event set, and 
that the event set is nondenumerably 
infinite. 

All in all, the characteristics of 
ASEU models based on infinite classes 
of events seem unattractive. We may 
adopt any of three strategies. One 
is to accept and live with these un- 
fortunate properties. That 
bility has already been examined. It 
is now time to mention very briefly 
the second set of possibilities. 


real world 


possi- 


Finite Event Sets 


The arguments given 
against finite events 
think, strong enough to argue against 
their serious consideration. How- 
ever, it is interesting to speculate 
further about their mathematical 
properties. It seems quite likely to 
me that analogues of Theorems 2 and 
3 must exist for finite cases. Such 
theorems would define maximum dif- 
ferences between SP and OP as a 
function of number of events in the 
finite event set, spacing of the OPs of 
those events on the OP continuum, 
number of different SP functions 
permitted (if that number is less than 
the number of events in the event 
set), and number and location of 
inversions of ordering between SPs 
and OPs. It seems plausible both 
that such theorems exist and that 
Theorems 2 and 3 are limiting cases 


previously 


sets were, | 
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as the number of events in the event 
set approaches infinity. Unfortu- 
nately, I have been unable even to 
guess what the form of such theorems 
might be (except for specially con- 
structed cases, such as the case of 
equal spacing of events along the OP 
continuum). As is often the case, 
it looks as if the finite cases 
mathematically much more difficult 
than the infinite cases. Even in the 
case where only a single monotonic 
function relating SP to OP is per- 
mitted (surely the simplest finite 
case), I have been unable to formulate 
a specific theorem relating number 
and spacing of events in the event set 
to maximum possible difference be- 
tween OP and SP. 


are 


The Third Strategy 


This paper has 
classes of ASEU 
infinite event sets and a class based 
on finite event sets; all are unattrac- 
tive for various reasons. The third 
possibility is to abandon ASEU models 
altogether and use NASEU models 
instead. Some previously discussed 
considerations about utility functions 
make this option attractive. But the 
NASEU models are so much 
powerful than the ASEU models that 
I would be very reluctant to turn to 
them on a priori grounds alone. So 
before examining them seriously, it is 
worth while to review the empirical 
evidence which bears on them. 


considered two 


models based on 


less 


THE PROBABILITY PREFERENCE DATA 


A rather substantial body of data 
bears on the relation between SP and 
OP. These data come from a series 
of experiments (Edwards, 1953, 1954a, 
1954b, 1954d, 1955) which I have 
collectively called the probability 
preference experiments. 

The basic design of the probability 
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preference experiments was as follows: 
Lists of bets were prepared. Each 
list included eight bets, with OPs 
of winning (or losing, depending on 
the list) running from 1/8 through 
8/8. In various experiments a total 
of eight different were used. 
The eight bets in each list all had the 
same objective EV, so that there was 
no objective reason to prefer one bet 
in a given list to any of the others in 
the list. All 


were paired with one another accord- 


lists 


same bets in each list 
ing to the method of paired compari- 
At total 
well over 100 college students were 


sons. various times, a of 
required to choose between the mem- 
bers of these pairs. 
the Ss in 
looked at slides of the pairs of bets, 


In some experi- 


ments, sat classrooms, 
“as if they were 


Other Ss, run individu- 


and made choices 
gambling.” 
ally, first imaginary 


then gambled for worthless chips, and 


made choices, 
finally for real money. 

The main results of these experi- 
ments, taken as a whole, were: 

1. Although there were substantial 
individual differences in choices, cer- 
tain patterns of choices showed up in 
all experimental groups and in just 
about all individuals. The two most 
outstanding of these patterns were 
that Ss usually preferred the bet 
with the 4/8 probability of winning 
from any positive EV list to the other 
bets on that list, and usually preferred 
the bet with the lower probability and 
higher amount of loss in any pair of 
negative EV bets. 

2. The preferences cited above, and 
indeed the complete pattern of choices, 
were relatively independent of EV 
levei of the list of bets involved, so 
long as the zero point was not crossed. 
This means that the preferences ob- 
attributed to the 
OOPS involved, which were constant, 


ecrved should be 
ier than to the amounts of money, 
hich varied from list to list. This 
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fact suggests that for these students 
and these amounts of money, the 
utility curve is relatively linear with 
amounts of money, while the SP 
curve is not linear with OP. It also 
suggests that the value of OP from 
which SP deviates most widely is 0.5, 
for positive EV bets. Using the 
assumption that SP = OP and the 
further assumption that the size of 
the just noticeable difference for 
utility is half a cent, it is impossible 
to construct a utility function to 
account for the preferences observed. 
If the size of the just noticeable differ- 
ence for utility is assumed to be zero, 
of course such a curve can be con- 
structed, but it has at least 12 inflec- 
points between $0 and +$6. 
The same sorts of statements can be 
made if the data are analyzed S by S. 


tion 


3. The complete pattern of choices 
changed radically from positive EV 
to negative EV bets, even though 
exactly the OPs were used. 
The main change was a strong prefer- 
ence for negative EV bets in which 
the probability of losing was low and 
the amount of possible loss high. 
When this preference was removed 
from the data by crude statistical 
means, the residual preference pat- 
tern pretty much the mirror 
image of the preference pattern for 
EV These findings 
suggest that there is a strong inter- 
action between utility and SP. 


same 


was 


positive bets. 


4. The differences among just im- 
agining, gambling for worthless chips, 
and gambling for real money are dis- 
cernible, but not sufficiently serious 


to invalidate any of the conclusions 
This 


the difficulties which might arise in a 


discussed above. means that 
gambling experiment because S wins 
or loses on each bet before he decides 
the bet much 


severe than a priori considerations 


about next are less 


might lead one to expect. 
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5. Minor changes in wording, in 
the choice of payoff events and in the 
financial status of Ss do not make any 
detectable differences in their choices. 
In particular, Ss can be made to win 
exceedingly large amounts of money, 
or made to lose substantial but smaller 
amounts of money, 
cantly 
choices. 


without signifi- 
their patterns of 
This finding has been con- 
firmed in a study devoted especially 
to it (Edwards, 1954d). 

Findings 1 and 2 cause me to 
reject the hypothesis that SP = OP. 
Therefore, they, in conjunction with 
the discussed in the 
previous section, cause me to reject 
all EU and ASEU models. The im- 
plications of Findings 3, 4, and 5 for 
a NASEU model are important; they 
will be discussed below. 


changing 


considerations 


NASEU MopbeE.s 


What are the mathematical proper- 
ties, if any, of a NASEU 
Take a look at Equation 1. It defines 
NASEU as well as ASEU models. 
But what does the operation of addi- 
tion called for by the sigma in Equa- 
tion 1 mean for a NASEU model? 
For an ASEU model, that sigma sim- 
ply derives from the usual definition 
of a probability measure, and reflects 
the additivity property of that meas- 
ure. For a NASEU model, it has no 
such simple mathematical justifica- 
tion. A variety of psychological 
arguments about its plausibility are 
available. This paper will not press 
them; evidence to be presented later 
argues against this form of NASEU 
model, 
defend such a model. 


model ? 


advocate or 
The purpose of 
this section is to introduce some ideas 
which are a part of the model which 
I dodefend. For the time being, there 


and I do not 


fore, it is enough to say that no law of 


menfor mathematics forbids the multi- 
plication of utility and nonadditive 


SP values, the algebraic addition of 
the products, or test of the hypothesis 
that people make decisions in such a 
way that they in fact maximize the 
resulting sums. 


Ratio Scale Utilities 


However, Theorem 1 indicates that 
a NASEU model must use a ratio 
scale measure of utility. A _ ratio 
scale of utility implies that utility 
has a true zero point. Where is it? 
Only one answer is plausible: where 
you now are. Zero utility is your 
current position, and you can never 
leave it. 

This is not exactly a new idea. 
The first implicit use of it was by 
Mosteller and Nogee (1951), who 
used the EU model to make a utility 
scale for money (and, incidentally, 
proposed nonadditive SPs as an 
alternative explanation for their re- 
sults). Mosteller and Nogee used a 
gambling situation in which real 
money changed hands during the 
course of the experiment. But in 
determining the utility of a given 
amount of money, they simply found 
an indifference point which involved 
that amount of money without mov- 
ing the origin to take into account S’s 
financial ‘status at the time they 
measured it. Such a procedure can 
yield a classical utility function only 
if the form of that function is in- 
variant up to a linear transforma- 
tion under movements of the origin 
(through which all utility functions 
conventionally pass) along the func- 
tion. Only a very limited class of 
functions, of which the most familiar 
member is the straight line, has this 
property, and the utility curves 
Mosteller and Nogee obtained don’t 
look like any member of that class. 
Consequently those curves in fact 
defined zero utility as the present 
monetary status of each S, and so 
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were of the type that this section 
advocates. 

The first self-conscious use of an 
idea like that proposed here was 
made by Markowitz (1952). He 
suggested that the zero point of the 
utility scale be taken as S’s customary 
financial position, and that the form 
of the function changes when that 
customary financial position changes. 
He used this idea to remedy a defi- 
ciency in Friedman and Savage's 
(1948, 1952) previous account of 
gambling and insurance buying. The 
only difference between Markowitz’s 
position and this one is that Marko- 
witz defines zero utility as the custom- 
ary position, while this paper defines 
it as the current position. 

Although this conception of utility 
is novel and quite different from the 
traditional one, it is neither internally 
contradictory nor absurd. Nor does 
it imply that people cannot change 
their habits as they change their 
financial status. Such changes in 
habits simply show up as changes in 
the shape of the utility curve. Fur- 
thermore, this point of view is not 
empirically contentless. If it is to be 
useful, people must not make serious 
changes in their subjective value 
scale for a commodity as a result of 
relatively small changes in the amount 
of that commodity which they possess. 
This proposition is in principle test- 
able, though testing it experimentally 
is difficult. However, Findings 4 and 
5 of the probability preference experi- 
ments (see above) suggest that it 
may be correct. 


Indifference Curves and a Property of 
NASEU Models 


Loss of the addition theorem for 
SPs, even though compensated by a 
true zero for utility, is a very serious 
loss. Can a model of the type de- 
fined by Equation 1 mean much with- 
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out an addition theorem for SPs? 
Yes. Equation 1 implies a strong 
decomposability property which has 
considerable significance both mathe- 
matically and empirically. The fol- 
lowing discussion exhibits this prop- 
erty and derives two theorems, one 
of which can be empirically tested. 
These theorems have the additional 
advantage of being true for all 
plausible models defined by equations 
like Equation 1. This means that an 
empirical verification of them would 
be strong evidence for the usefulness 
of the class of maximization models 
with which this paper is concerned, 
while an empirical disproof would 
necessitate search for a quite different 
kind of theory of decision making (e.g., 
one based on variance preferences). 
The following discussion deals ex- 
clusively with bets of the following 
form: Event E has probability p of 
occurring. If it occurs, S receives an 
amount of money x, which is greater 
than 0. If it does not occur, no 
money changes hands. Any such bet 
can be described as a point (x, p); 
that is, all such bets can be com- 
pletely represented by a plane bounded 
at p=0, p=1, and x =0. Now, 
consider the bets (x, p) and [ (x + e)p ] 
where eis a small positive number. 
A great deal of evidence, plus common 
sense, suggests that most Ss would 
prefer the second of these two bets 
to the first. Similarly, [x, (p + f)] is 
preferable to (x, p) where f is another 
small positive number. If both of 
these assertions continue to be true 
as e and f approach zero for any 
values other than zero of x and p 
(a reasonable idealization), and if the 
direction of preference reverses for 
any negative values of e and f (an- 
other reasonable idealization), it fol- 
lows that a set of indifference curves 
can be drawn in the xp plane. An 
indifference curve in this application 
is simply a function which defines a 
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Fic. 1. Hypothetical indifference 
among simple bets which have ‘‘no money 


changes hands” as one of the two possible 


map 


outcomes. The rectangles are described in 


Theorems 4 and 5 of the text. 


set of bets among all of which S is 
indifferent (see Coombs & Beardslee, 
1954: Edwards, 1954c for details). 
A set of indifference curves is called 
an indifference map; Figure 1 shows 
such a map. ‘The properties to be 
discussed below are general properties 
of a certain family of indifference 
curves and have applications to the 
use of indifference curves in economic 
theory; these applications will be 
discussed at the end of this section. 

What does the SEU model have to 
say about such indifference curves? 
Since each curve identifies bets among 
all of which S is indifferent, all bets 
on a given curve must have the same 
SEU. In utility and SP 
measures the equation of each in- 
difference curve c must be: 


short, in 


Yu. = k- | 4 | 
Of course there is a different value of 
k for each curve. 

But indifference curves are stated in 


What 


physical, not subjective, units. 
implications does Equation 4, and 
therefore SEU maximization models, 


have about the form of indifference 
curves stated in physical units? It 
is obvious that if any single indiffer- 
ence curve is selected, it is possible 
to find a transformation on the prob- 
ability and money axes which will 
make it conform to Equation 4, which 
is of course the equation of a rectangu- 
lar hyperbola. In fact, any set of two 
indifference curves could be made to 
conform by proper choice of trans- 
formations. But that uses up all the 
arbitrariness implied in Equation 1. 
To put it another way, the trans- 
formations necessary to make these 
two curves conform to Equation 4 
completely specify the utility and 
subjective probability functions for 
this S. Now, if Equation 4 is to be 
correct, it must turn out that all 
other indifference curves in the xp 
plane must be turned into rectangular 
hyperbolas by the same utility and 
SP functions which turned the first 
two into rectangular hyperbolas. This 
is an exceedingly strong condition on 
the family of indifference curves. 
The following proof shows that an 
indifference map satisfies this require- 
ment if and only if a quite specific 
relationship exists among the slopes 
of the indifference functions passing 
through any four points at the corners 
of a rectangle whose sides are parallel 
to the coordinate axes. 

The theorem assumes that an in- 
difference map is given each of whose 
functions is continuous, monotonic 
decreasing, and has continuous deriva- 
tives. The utility and SP trans- 
formations are assumed strictly mono- 
tonic increasing. They are also 
assumed single-valued, which means 
that the events displaying the OPs 
used must all have the same identify- 
ing characteristics if Theorems 4 and 5 
are to be applicable. Two values of 
money, x; and x», and two values of 
probability, p; and ps, are chosen 
at random (or systematically; it 
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doesn’t matter how they are chosen 
so long as both amounts of money 
are greater than zero and both proba- 
bilities lie between and one); 
they of course jointly define four bets. 
Let us call these four bets A, B, C, 
and D, as in Figure 1. From now on, 
S4 means the slope of the indifference 
curve which passes through Bet A. 
Theorem 4: If any kind of SEU 
model, additive or otherwise, is cor- 
rect, then SaSc = SgSp, and if this 
condition on the slopes of an indiffer- 


zero 


ence map is satisfied, and if at least 
one indifference function has a nega- 
tive slope, then it follows that for 
these data a NASEU model or some 
more powerful SEU model is correct.‘ 

The proof of Theorem 4 is in the 
mathematical appendix. 

The of all SEU maxi- 
mization models, including NASEU 
established — in 


property 
models, which is 
Theorem 4 is theoretically elegant 
and in principle empirically testable. 
No such empirical test is likely to 
take place, however; differentiation 
of curves requires 
precision than is usually obtainable 
in experiments Ss. 
Fortunately, the microscopic property 
described in Theorem 4 has a macro- 


empirical more 


using human 


scopic equivalent, which can be tested 
experimentally. 

Look at Figure 1 again. Choose an 
of money x‘), significantly 
smaller than x; Extend a vertical 
line from x’; to the indifference curve 
which through D; call the 
intersection point J7. From J/ extend 


amount 


passes 
a vertical line until it intersects the 
indifference curve which passes through 
A, and a horizontal line until it inter- 
sects the indifference curve which passes 
through C; call the two points thus 
identified E and G. A horizontal line 
can be extended from E and a vertical 


‘This theorem and its proof were formu- 
ated by George Minty. 
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one from G, intersecting in a point F 
thus completing another rec- 
tangle. Will Point F lie on the indif- 
ference curve which passes through 
Point B? It turns out that it will if 
and only if the indifference map has 
the property expressed in Theorem 4. 
It is probably intuitively apparent 
that the property of Theorem 4 can 
be derived from this one; Theorem 4 
is simply the limiting case as the 
difference between x; and x’; becomes 
This property can be 


and 


infinitesimal. 
expressed formally as 

Theorem 5: If any kind of SEU 
model, additive or otherwise, is cor- 
rect, then if a second rectangle is 
constructed as in Figure 1 and three 
of its vertices are chosen to lie on the 
same indifference curves as three of 
the vertices of the original rectangle, 
the fourth vertex of the new 
rectangle must lie on the same in- 
difference curve as the fourth vertex 
of the original rectangle. Further- 
more, if the vertices of the new rec- 
tangle are connected by straight lines 
to the vertices of the old rectangle, 
the slopes of the connecting lines 
will be related by the equation 
SaeSce = SprSpu, where Sag sym- 
bolizes the slope of the line connecting 
Point A with Point £ in Figure 1. 
If these conditions are satisfied, and 
if at least one indifference function 
has a negative slope, then it follows 
that for these data a NASEU model 
or some more powerful SEU model is 
correct.® 

To prove Theorem 5, start by look- 
ing again at Equation 4. With 
notation taken from Figure 1, it says 
that: 


then 


| 


5] 
[6] 
[7] 


was 


g(x’s)h( p's) = g(xi)h(ps) 


g(x’s)h(p’s) 


g(xi)h(p2) 
g(x’o)h(p'1) = g(x2)h(pi) 


Minty’s theorem 
Savage, 


» This extension of 
formulated by L. J. 
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where g(x) is the utility of x, h(p1) 
is the SP of Pr, and so on. Now 
multiply the left half of Equation 6 
by the left half of Equation 7 and 
divide by the left half of Equation 5, 
and similarly for the right halves of 
these equations. The results are: 


r 


g(x’2)h (p's) g(x2)h (ps) 8 ] 
Equation 8 is what the first half of 
Theorem 5 asserts. 

Now note that if a line is drawn 
from A to E and similarly for the 
other three pairs of vertices, each of 
these lines is a chord of an indifference 
curve. The slope of any of these lines 
is given by: 


where: 


A little 
SAESCG 
second 
This macroscopic 
analogue of Theorem 4. In fact, 
Theorem 4 follows immediately from 


will show that 
SprSpu, Which is what the 
half of Theorem 5 


result is a 


algebra 


asserts. 


close 


it if x’; is allowed to approach x, for 
then the four chord slopes approach 
the slopes of the indifference curves at 
the vertices of the original rectangle. 
This proof of Theorem 5 is only a 
proof of necessity; a proof of sufh- 
ciency very similar to that given in 
the Appendix for Theorem 4 exists, 
but will not be given here. 

The NASEU (and 


other) models expressed in Theorem 5 


property of 


can be tested by trying to perform the 
construction specified in the first 
half of Theorem 5. The relationship 
among slopes specified in the second 


half of Theorem 5 is tautologously 
true for all pairs of parallelograms 
with sides parallel to the coordinate 
axes, and so provides no additional 
information concerning the NASEU 
model. 


An experiment designed to 
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test this property, and so to test this 
class of models, is now under way. 
Although the foregoing discussion 
was expressed in terms of bets, it is 
actually a quite general property of 
certain kinds of indifference curves. 
The properties of Theorems 4 and 
5 clearly apply to any indifference 
map whose curves have the form 
f(x)g(y) = k. But they also apply to 
any indifference map whose curves 
have the form f(x) + g(y) =k. (To 
prove this, simply replace the multi- 
plications in Equations 5-7 by addi- 
tions and the divisions by subtrac- 
tions. The proof now proceeds as 
before, using additions instead of 
multiplications and subtractions in- 
stead of divisions up to but not in- 
cluding Equation 9. A similar modi- 
fication of the proof of Theorem 4 is 
possible.) This additive form of the 
indifference curves corresponds to an 
indifference map concerned with two 
goods which are independent, rather 
than competing (like butter and 
margarine) or completing (like right 
and left shoes). Thus Theorems 4 
and 5 provide an operational test of 
the hypothesis that two goods are 
independent of each other for a given 
indifference map know, 
the only operational test so far sug- 


so far as | 


gested which can be applied directly 
to an indifference map. 
this test is applicable only if 
existence of at least 
utilities is assumed. 


Of course 
the 


interval scale 


The Weighted SEU Model 


Most of the argument of this paper 
has been devoted to the proposition 
that for any of the class of models 
considered here except NASEU models 
any function relating SP to OP may 
only be The probability 
preference data indicate that it can- 
not be linear. Or do they? 

An old familiar finding in psycho- 


linear. 





SupjectivE PRopaBiLiries INFERRED FROM DecIsIONS 


physics is that the form of any sub- 
jective scale depends on the methods 
used to determine it. The same prop- 
osition may be true for SP and utility 
scaling. The probability preference 
experiments were a rather indirect 
method inferring utility and SP 
properties from gambling choices. 
What results would more direct 
methods yield ? 

In the case of utility of money, it 
seems unlikely that direct methods of 
psychophysical scaling will be useful. 
The kind of question which would 
have to be asked would sound like 
“What amount of money is twice as 
valuable as two dollars to you?” It 
seems unlikely that any S would give 
an answer systematically different 
from ‘‘Four dollars,’’ no matter what 
his utility function for money might 
look like. The difficulty is that 
money has an invincibly numerical 
character, and most people would 
probably respond to mathematical 
properties of these numbers rather 
than to any subjective values they 
might have. In any case, the evidence 
is continuing to accumulate that for 
small amounts of money utility is 
linearly related to dollar value—in 
which case psychophysical determina- 
tion of utility scales is not very likely 
to be interesting. Of course psycho- 
physical methods could be used to 
scale the utility of other objects 
which have a less numerical character, 
but it isn’t obvious how such experi- 
ments would shed much light on 
general properties of utility functions, 
because in such experiments the con- 
cept of objective value is difficult to 
define. 

In the case of SP, on the other hand, 
some psychophysical data exist. Shu- 
ford (1959) performed an experiment 
in which he showed Ss square matrices 
made of 400 thumbtacks, some red 
and some green. The Ss were re- 
quired to estimate the relative fre- 


quency of each kind of thumbtack 
in the display after a brief look, and 
succeeded in estimating with remark- 
ably good accuracy. Relative fre- 
quency in a matrix isn’t exactly the 
same thing as OP, but is very closely 
related. Shuford’s experiment sug- 
gested one in which I used these 
square matrices as the basis for 
probabilities in the probability prefer- 
ence experiment situation. I dis- 
played two such matrices and asso- 
ciated a bet with each. The Ss were 
required to estimate the relative 
frequency of each color in each of the 
two matrices, and then were required 
to choose one of the two bets. A 
random device (two 20-sided dice) 
one location on the matrix 
associated with the chosen bet, and 
the color of the thumbtack at that 
location determined the outcome of 
the bet. Preliminary results show 
that the estimates of relative fre- 
quency are excellent, just as in 
Shuford’s experiments, but that the 
made among the bets are 
distorted. In short, at almost the 
same moment the Ss say that SP is 
and is not equal to OP. Something 
peculiar is clearly going on. 

My interpretation of this finding is 
that people perceive OPs correctly, 
but that they misuse them. They 
simply prefer to bet at some OPs 
rather than others, even though they 
perceive these OPs quite accurately. 
This hypothesis can be translated into 
mathematics by supposing that they 
attach to each event a weight w; which 
expresses the relative desirability or 
undesirability of the probability dis- 
played by that event. A model of 
this sort is only a minor variation on 
Equation 1: 


WSEU = >. (piwi)u; 


chose 


choices 


[10] 


Mathematically, the WSEU model 
is exactly the same as the NASEU 
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model (which is why this paper has 
the 
model) 


much attention to 
properties of the NASEU 
except that the WSEIL 
by its 


devoted so 


model comes 
addition operation naturally 
rather than arbitrarily. Since there 
is no mathematical difference between 
the NASEU and WSEU models, the 
difference between them is presum- 
ably one of esthetics; I find the WSEU 
model esthetically more appealing. 

I cannot resist speculating a bit 
about the nature of a WSEU model. 
1 assume, of course, that it applies to 
How many differ- 
ent sets of weights are necessary? In 


infinite event sets. 


the language used earlier in this paper, 
how many different pages are there in 
the SP book? 
able suggest the speculation that 
there may be exactly pages in 
that book, each page defined by a 


The data now avail- 


five 


class of possible payoff arrangements. 
In Class 1, all possible outcomes have 
In Class 
possible outcome (or 


utilities greater than zero. 
2, the worst 
outcomes, if there are several possible 
outcomes all with equal utility) has a 
utility of zero. In Class 3, at least one 
possible outcome has a positive utility 
and at least one possible outcome has 
a negative utility. In Class 4, the 
best possible outcome or outcomes has 
a utility of zero. And in Class 5, all 
possible outcomes have negative utili- 
ties. (Note that this classification is 
meaningful only because of ratio scale 
utilities.) 
based on Finding 3 of the probability 


This speculation is partly 
preference experiments already dis- 
cussed. 

Another speculation of some _ in- 
terest concerns possible limitations 
on the complete freedom in choice of 
weights in WSEU models (or in choice 


of SPs in NASEU 


extremely unlikely, for example, that 


models). It is 


anyone would have a set of weights or 
a SP function which Wp 
(or y) 0.9 both to the occurrence 


assigned 
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and to the nonoccurrence of an event 
A. Although people do not use addi- 
tive SP functions, at the same time 
they do not behave in as lawless a 
fashion as the discussion has so far 
suggested. The problem is to know 
what the actual limitations on addi- 
tivity are, and this problem is pri- 
marily empirical. Information can 
be gained by inspection of SP func- 
tions, by examination of the results of 
experiments in which Ss must com- 
bine probabilities, 
other ways 
which capture 
of approximate 


and perhaps in 
Several axioms 
parts of this 
additivity can be 
dreamed up; it is a question whether 
anything useful can be deduced from 
them (so far nothing has been) : 


also. 


idea 


only if 


1. ¥(A) > 06.5 if 
W(A) < 0.5. 


2. If A; is the i element of a parti- 


SA; = 1 


and 


tion of S, then < k where 
k is some fixed constant less than 1. 
3. If A; is the 7“ element of an 
n-fold partition of S, then |}> A; — 1 
< f(m), where f(m) is nondecreasing 
and has a maximum value or a limit. 


Note that these possible axioms are 
alternative formulations, not parts of 
Whether they or 
anything like them are of any use de- 
pends on what (if anything) they 
imply, and on the data. Axiom 3, 
which the extent of the 
failure of additivity depends on the 
number of elements in the partition 
of S, is particularly attractive to me 
because of 


the same structure. 


states that 


some unpublished data 
which suggest its usefulness. 

Of course, all the models considered 
in this paper are static. That means 
that they do not consider possible 
changes over time. Static models are 
presumably simpler and 
study than dynamic ones, which do 


easier to 
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consider changes in time. The usual 
real life decision situation, however, 
requires a man to make not one but a 
deci- 
of in- 


sequence of decisions. These 
sions are embedded in a flow 
coming formation. ‘They change the 
nature of the incoming information in 
two ways, by modifving the environ- 
generates it and by 


ment which 


modifying the relation between en- 
vironment and person which makes it 
available. Thus the crucial question 


for most real-life decisions concerns 
the relation between information flow 
But that is the 


problem of dynamic decision making, 


and decision making. 
which is not the subject of this paper. 


Pre 


Variance 


\llais 1953b) 
both before and after him have sug- 


herences 


1953a, and others 
gested that the variance of a bet may 
important than EV, 


objective or subjective, in determin- 


be more its 


ing preferences. This suggestion is 
intuitively appealing, but extremely 
hard to translate into an experiment. 
| published one attempt to establish 
the existence of variance preferences 
(Edwards, 1954d) in which the con- 
clusion was that they exist, but are 
small in size relative to probability 
Pruitt 
(1960) have some data which suggest 
that both 
preferences exist, and are important. 


preferences. Coombs and 


variance and’ skewness 
In any case, the existence of variance 
or skewness preferences is likely to be 
any like 
10, or with 
maximization model of the kind 
paper. Unfor- 
tunately, the Coombs and Pruitt data 


inconsistent with model 


Equations 1 or indeed 
any 
discussed in this 
were choices among two-outcome bets. 
variance, 
payoffs, probabilities, and expected 
are related by a network of 
equations which mean that there are 


For such bets, skewness, 


value all 


too few degrees of freedom to ascer- 
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tain with which among these quanti- 
ties preferences are in fact correlated. 
The only way I know to disentangle 
all these quantities empirically is to 
use bets with more than two possible 
both Coombs and | plan 
such experiments. 


outcomes; 


The conditions on slopes of indiffer- 
ence which were stated in 
Theorems 4 and 5 would in principle 
provide experimental tests for the 
reality of variance and skewness pref- 
erences, in the sense that if either is 
satisfied then variance and skewness 
preferences cannot have played a role 
from which the in- 
curves were constructed 
except they were reflected in 
utility and SP functions. 


curves 


in the choices 
difference 


as 


If, as seems likely, experiments re- 
veal that variance preferences exist, 
and are not reducible to utility and 
SP functions, what direction should 
theoretical endeavor take then? | 
don't know. One thing is clear: such 
a demonstration would be the final 
blow for the class of models with 
which most theorists of decision mak- 
ing have been concerned until now 
and with which this paper has dealt. 


SUMMARY 
Most contemporary decision theo- 
explain risky 
alternative courses of action by sup- 


ries choices among 
posing that the available course of 
action with the highest subjectively 
expected utility will be preferred to 
all Subjectively expected 
utility is calculated by finding the 
subjective value or utility of each 
possible outcome of the course of 
action, multiplying it by its subjec- 
tive probability of occurrence, and 
all 
The concept of 


others. 


summing these products over 
possible outcomes. 
subjective probability in this theory 


is quite confusing. 
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his paper shows that if the sub- 
jective probabilities of a set of events 
of which some one must happen and 
not more than one can happen must 
add up to a specified constant (usu- 
ally 
then 


taken as one for convenience), 
utility 
interval scale and subjective prob- 
ability on a ratio But if 
subjective probabilities do not have 
this additivity then both 
utility probability 
must be measured on a ratio scale. 


can be measured on an 
scale. 


property, 
and_ subjective 

If subjective probabilities do have 
the additivity property, if the deci- 
sion model in which they are used is 
in principle applicable to any con- 
and if the 


ceivable set of events, 


number of different subjective proba- 


bilities which may occur in conjunc- 
tion with a given objective probability 
is no more than denumerably infinite, 
then whenever objective probabilities 
defined subjective 


are probability 


must equal objective probability. 
This theorem and another mean that 
the concept of subjective probabilities 
which must add to one, and decision 
models based on that concept, cannot 
be very helpful in explaining decision 
making: a large amount of data indi- 
cates that subjective probability can- 
not always equal objective probability. 

One alternative is to use a decision 
model in which subjective probabili 
ties need not add up to any particular 
Such a model is possible ; 
One 


model, 


constant. 
its properties are explored. 
that in 
utility must have a true zero point. 
The only 
utility is 
notion is not new 


Any 


maximization hypothesis (with addi- 


property is such a 
reasonable zero point for 
this 
to decision theory. 


where you now are; 


form of the expected utility 


tive or nonadditive subjective proba- 
bilities) implies a strong relationship 
the 
curves for simple bets. 


indifference 
This relation- 


among slopes of 
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ship leads to an experimental test of 
the applicability of 
If the result of this test should 
be negative, further theories about 
decision making in risky situations 
should probably include the concept 
of variance preferences. 


models of this 


class. 
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\PPENDIX*! 


This appendix contains precise state 


ments of the assumptions and proofs of 
Theorems 2, 3, and 4 from the body of 


the article. The proofs of Theorems 1 

and 5 are contained in the text. 
Theorems 2 and 3 assume the existence 

of a_ probability with 


space S events 


article for credit footnotes 


identifying the formulators of each theorem 


Al See body of 
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A, B, C, +++, having objective probabili- 
ties P(A), p(B), p(C), «++; the measure p 
has the conventional properties. Asso- 
ciated with each Event A is an element 
r(A) R: r(A) is called the 
identifying characteristic of Event A. 
Two or more events (indeed, large classes 


of a space 


of events in the models discussed in this 


paper) may have the same identifying 
A function I(r, p) is 
defined for each r and p such that for 
some Event A, r(A) = rand p(A) = p. 
The subjective probability Y(A) of the 


Event A is then defined by the equation: 


characteristics. 


¥(A) =T[r(A), p(A)] 
Several assumptions, mostly of tech- 
importance only, are 
are stated here with little discus- 


nic al 
They 


necessary. 
sion; the important ones are discussed 
in the text. 
\xiom 1: p is completely additive. 
\xiom 2: S is nonatomic with respect 
to p. This means that if p(A) > 0, then 
there exists some Event B contained in A 
a subset of A) such that 0< p(B) <p(A). 
\xiom 3: For each r for which I(r, 0) 
is defined, ['(r, 0)=0, and lim ['(r, p) =0 
p-0 
uniformly in r. That is, given ¢ > 0 
there existsa6 > Osuch that T(r, p) < € 
whenever p < 6 and I(r, p) is defined. 
Axiom 4: W(A) is a_ probability 
Spec ifically: (a) For AC S, 
<= 1, (b) y(S) az |} and 
oO Es Be Sas 
then ¥(A U B) = Y(A) 


measure. 
0 Ss Y(A) 
y(S) = 0, and 
Af) B=0, 

+ ¥(B). 

There is some redundancy in the state- 
Axiom 4; it was included 
clarity and to permit Theorem 4 to be 
proved without the use of Part c of the 


ment of for 


axiom. 

Some lemmas prepare the way for the 
proof of Theorems 2 and 3. 

Lemma 1: The probability measure y 
is completely continuous with respect to 
p, and completely additive. 

From Axiom 3, it follows immediately 
that to any number e > O there corre- 
sponds a number 6>0 such that ¥(A) <e 
Event A such that p(A) <6 
This is what complete continuity means. 


for any 
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Now let Ai, Ag, A3, ae 


be a countable 


sequence of disjoint events such that 


Since p is completely addi- 


> p(A,) and 


n=1 


have 


p(A) oa 


20 


= U Ax) lim p(B,) = 0. It 


nH 


(setting B,, 


follows from the discussion 


that lim ¥(B,) = 0. 


foregoing 
Since p(A) exists, 
and since it follows from 


so does y A), 


n—1 


\xiom 4 thatw(A) = >> ¥(A,) + W(B,) 
k=l 


for all m, we conclude 


that w(A) 
(A,), and thus that W is com- 


pletely additive 
Lemma 2: 
to v. 
Proof: Lemma 1 makes it possible to 
apply the Radon-Nikodym theorem (see 
for example Halmos, 1950, p. 128) to 
infer that Y has a probability density 
with respect to p. If A is any event 
with ~(A) > 0, it must include a Sub- 
event A’ such that p(A’) > 0 and the 
density of ¥ with respect to p is positive 


S is nonatomic with respect 


on A’. Since S is nonatomic with respect 
to p by Axiom 2, there is an event 
BC A’ such that 0 < p(B) < p(A’);con- 
(B) <y(A’) SYA). 


This proves the lemma. 


sequently also 0 «< 


Lemma 3: Let f be a (Borel-measurable) 
function defined on S such that 0 S f(w) 


<= 1 for w in S, and let Q; and Q» be 


( omplet ly additive nonatomi prob 


ability measures. Then there exists an 


Event £ such that: 


J siceraay 

Ss 

fs w)dQs(w) 
8 


This result is a spec ialization of one 
established by Dvoretsky, Wald, and 
Wolfowitz (1951a, 1951b), representing 
an extension of an earlier theorem of 
Lyapounov (1940). 

It is convenient at this point to inter- 
pret the objective and subjective prob- 
abilities of an Event A as the rectangular 
coordinates of a point [p(A), ¥(A) Jina 
Then if p and y are 
not identical measures there must exist 
an Event H for which ~(//) ¥ p(#), 
and_the points (0,0), [p(H), ~(4)], 
(p(H), ¥(H)], and (1,1) (where His 
the complementary event to H) form a 
The following 
there is at least one event 


Euclidean plane. 


parallelogram. lemma 
asserts that 
corresponding to every point within the 
parallelogram. 

Lemma 4: Let H be 
let a and b be any two numbers such that 
OSaStandOSbZ1. Then there 
exists an Event C such that p(C) = ap(/7) 
+ bf 1 — p(H)] ¥(C) = af(H1) 
T b[ 1 - ¥(H)]. 

Proof: Let g be the characteristi 
function of H (that is. let g(w) be 1 for 
win H, and 0 Then by 
\xioms 1 2 Lemmas 1 2 


any event, and 


and 


otherwise). 
and 2 and and 2 
the hypotheses of Lemma 3 are satisfied 
with Q; = p, Q. = y, and f(w) = ag(w) 
+ b[1 — g(w)]. The 
the event whose existence is asserted by 
Lemma 3. 


Event C is then 


Now it is possible to prove 

Theorem 2: If there is an Event H for 
which ¥(H1) # p(H), then for any value 
of p within the limits 0 < P < 1 there 
exist Events A and B satisfying the 
relations p(A) = p, p(B) < p(A), ¥(B) 
>wW(A)+ Pil P) ¥(H) — p(A) 

Proof: It is sufficient to consider the 
case p(H) S 1/2, P S 1/2,W(H) — p(1) 
=h > 0; parallel arguments establish 
the result for other cases. 
first that 0< PS p(//). 
By Lemma 4 (with a = 0 and b = P 
[1 — p(H1)]) there exists an Event A 
such that p(A) = P, while yW(A) 
= Pi —y(/1))/[1 — p(A)]. Similarly, 
taking a = P/p(H), * =0 yields the 
conclusion that there exists an Event B’ 


with p(B’) = P, ¥(B’) = P(H)/p(D). 


Suppose 
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Subtraction yields the inequality: 
v(B’) — ¥(A) 


4 hP(1 — P) 

= —_——— >h _ 
p(A)[1 — p(A)] 
Next suppose p(//) < pS 1/2. The 
Event A may be selected as in the fore- 
going case, while an appropriate Event B’ 
corresponds tooa=1,b= [ P — p(H)], 
[1 — p(H)]. 


y(B’) —y(A) = (1) 
, P—p(f), 
: [i—y(/1)] 
1—p(A): VJ 
jl —y (1) h 
1—p(H) 1—p(H) 


Thus in this case also: 


-hP(1— P) 


Now 
sponding to the Event B’ be multiplied 
by the factor: 


let the values of a and b corre 


hP( 


P) + (A) 
y(B’) 


and designate an associated event by B. 
Since k < 1, it follows that: 


p(B) = kp(B’) < P = p(A) 


while 
y(B) =ky(B’) 
1/2[¥ (B’)+hAP(A—P)+y(A)] 
>hP(1—P)+y(A) 
Thus the Events A and B have the de- 
sired properties and the proof is complete. 
Theorem 2 could be strengthened, in 
that the expression hP(1 — P) could be 
replaced by function 


any continuous 


G(P) satisfying the inequalities: 
hP 

p(H)[1 — p(H)] 

0< PS min[p(H) 


G(P) 
1 — p(H)] 


h 
G(P) < - — 
: max [p(H), 1 — p(H)] 
min [p(//), 1 — p(A)] 


ars pb(H),1 — p(H)] 


max 


coun « 22 
“eS p(H)[1 — p(A)] 
max [p(H),1 —- p(XA)J SP <1 


Che proof is changed only by the re- 
placement of hP(1 — P), where it ap- 
pears, by G(P). 

Lemma 4 also makes it possible to draw 
a strong conclusion about the cardinality 
of the set of identifying characteristics. 

Theorem 3: If there exists an Event H 
for which ¥(H) # p(/1), the set R of 
identifying characteristics is nondenum- 
erably infinite. 

Proof: Let the 
theorem be satisfied for some H. 


hypothesis of the 
Sup- 
pose for convenience that: 
p(H) S 1/281 — p(A) 
Then 
ap(f) + bf 1 _ p(H)] = 1/2 


= [1/2 — ap(H)//[1 — p(A)] 


Since the foregoing relations imply that 
0Sb851if OSaZ1, Lemma 4 ap- 
plies, and for each value of a in the indi- 
cated range there exists an Event C, such 
that p(Ca) = 1/2, while: 


ae 1/201 —W(H1)]+aly(A) — p(A)] 
y Cg)= ——_ —_— ——_—_—_———————— 


1—p(H) 


Thus distinct values of a are associated 
distinct values of (Ca). 
b(C,a) is the same for all a, the identifying 
r(Ca) must likewise be 
distinct for distinct a’s. Thus a subset 
into one-to-one corre- 


with Since 


characteristics 


of R can be put 
spondence with the unit interval, and 
the theorem is established. 

Che general mathematical setup under- 
lying Theorem 4 is explained in the body 
of the paper; it is not altogether the 
same as that underlying Theorems 2 and 
3. Theorem 4 is concerned with bets of 
the form “with probability p you win 
$x; with probability 1— p no money 
Specifically, it as- 
sumes the existence of indifference curves 


changes hands.” 





Warp Epwarps 


bets, and inquires 


NASEI 


in a plane of such 


about the restrictions which a 

model puts on such curves. 
\xioms 1 and 2 of this Appendix still 

Parts 


\xiom 4 are assumed (but not 


apply, Axiom 3 is irrelevant, and 
a and 5b of 
Partc). The following additional axioms 
are assumed. 

Axiom 5: For the particular situations 
in which Theorems 4 
tested, y p) is a 
able, monotonically 
tion of p. 

Axiom 6 


in which 


and 5 are to be 


continuous, differenti 
increasing fun 

For the particular situations 
Theorems 4 and 5 are to be 
tested, u(x) is a continuous, differenti 


able, monotonically increasing func 
tion of 

asserted by 
That is, 


A is preferred to 


Axiom 7 The model 
Equation 1 of the text applies. 
in this situation Bet 


Bet B if and only if: 


y pa)u v4) 


Now we perform the construction de 


scribed in the paper and illustrated in 


Figure 1. If Sy 
difference curve passing through Bet A, 


is the slope of the in 


we Can prove 
Theore m 4 a 4.¢ SOD 


that if the in 
transformed 


First 
difference 


comes a prool 
curves can be 
into a form fitting Equation 4 of the 
half of Theorem 4 


If Equation 4 can be written 


paper, then the first 
must hold 
there must 


that 


for any indifference curve, 


exist two functions g and hk such 
the equation for each indifference curve 


can be written 


\1] 
Differentiating Equation 11 gives 


p)d 


dx 


which simply says that the slope of the 


indifference curve at any point may be 


written as a product of a function of x 
along and a function of p alone. Now 
it is tautologous that: 


P(x1)O(p2) — B(x2)O(p2) 
P\ ¥1)0(p.) i P (x2)0(p1) 


Substitution from Equation A3 reduces 
I quation A4 to: 


S(x1, pe) ; (Xs, py) \5] 
S(x1, pi) S(x2, pi) ’ 


But by definition (x;, pe) defines Bet A, 
vo, po) defines Bet B, (xe, pi) defines 
Bet C, and (x, p;) defines Bet D. So 
Equation A5 may be rewritten: 
SaSc = SpSp [A6] 
The proof so far has shown that if 
Equation 4 is correct then Equation A6 
follows. In short, Equation A6 is neces 
sary for the correctness of Equation 4, 
\xiom 7. 


and so of It remains to show 


that it is also sufficient. The statement 
of Theorem 4 given in the text of the 
that Equation A6 is 


well as 


paper specifies 


sufficient as necessary for the 
correctness of Equation 4 only if at least 
indifference negative 


one curve has a 


slope; this restriction is necessary to rule 
out transformations on p and x which 
are monotonically decreasing rather than 
monotonically increasing. To prove 
sufficiency given at least one indifference 
curve with a negative slope, it is neces- 
sary to exhibit transformations which 
actually fit Equation 4. 

To start, 


xo > 0,0 < po < 1. 


choose any point 


Now let 


(Xo, Po), 


= Prt) — 
( CNP 


yy =expS pb [A8] 


Yo. Po 


I 
d 
Po (xo, P) 


Now it is that 
these transformations fit Equation 4 if 
they fit Equation A6. 

First, 
and A&; 


necessary to prove 


differentiate Equations A7 
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Substituting into \9 from A6 


Wey 


du u S(x, p)S(xo, po) dx 


dy - vy ; 1 dp 


. zx di = Si Oy 


d 
X [—S(x, po)dx] But - = S(x, p) by definition. There- 


dy v path ; 
fore, = - which is the differential 
du u 
dy a v 5 (x0, Po) dp \97 equation for the family of rectangular 
du u S(x, po)S(xo, p) dx *  hyperbolas. 
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DECISION MAKING: 


OBJECTIVE MEASURES OF SUBJECTIVE 
PROBABILITY AND UTILITY! 


GORDON M 


11 Dynamics Corp 

This paper describes several meth- 
ods for measuring subjective proba- 
bility. 
a priori measures of 
measure both utility 
tive probability simultaneously. 


Some of the methods require 
utility, 
and 


others 
subjec- 
The 
be- 
the 


(S's) conscious opinions or 


choice 
reflect 


measures, derived from 
havior, do not necessarily, 
subject’s 
feelings. The subjec tive probability 
estimates may be useful as measures 


of confidence, bias, prejudice, or 
expectancy. 

The the 
modification of Edwards (1955) Sub- 


jective Expected Utility (SEU) model. 


basis for methods is a 


It is assumed that :* 

1. A 
choice between two or more alterna- 
the alternative that 
appears, to him, to offer the highest 
average satisfaction (SEU). The 
SEU of an alternative, A, is defined as: 


person confronted with a 


tives selects 


I 
SEU (A) = &¥ ¥(A,)u(A,) 
1 


1 The major ideas discussed in the first half 


of this paper were presented at the Thirteenth 
Annual Meeting of the Eastern Psychological 
\tlantic City, April 3, 1959. 
Chis investigation was supported in part by 
Research Grant M-2348(RI) from the 
National Institute of Mental Health, United 
States Public Health Service, in part by the 
General Dynamics Corporation, and in part 
by Itek Corporation 

2 Now located at TEMPO, General Elec- 
tric Company. 

3 The author is indebted to Ward Edwards 
for his help in making these assumptions ex- 
plicit, for encouraging the author to pursue 
these ideas, and for suggesting that the author 
consider the effects of various degrees of in- 


Association, 


formation about utility upon the determina- 


tion of subjective probability. 


BECKER? 


ation, Groton, Connecticut 


where: 

A; is the ith possible outcome of course of 
action A;i = 1,2, ---, J. 

y (A;) is S’s subjective probability that Out- 
come A; will occur. 

u(A;) is the satisfaction (utility 
to derive if Outcome A 
takes course of action A 


S expects 


occurs when S 


2. The sum of the subjective proba- 
bilities of a set of mutually exclusive 
and exhaustive events equals unity,‘ 
1.e. : 


3. Utility and = subjective 
bility do not interact. 
4. The satisfaction (utility) S asso- 


ciates with an event is measurable. 


proba- 


5. The measures of S’s utility reflect 
the utilities the 
subjective measurement 


operating during 
probability 
the 
changed during the period intervening 
between the utility and the subjective 
probability measurement sessions. 


session, 1.€., utilities have not 


In addition to the above, the as- 
the 
used to measure S’s utilities must be 
added. If the utility are 
based the that 
utility differences of all magnitudes 
are discernable (Coombs & Komorita, 
1958; Edwards, 1955; Luce & Raiffa, 
1957; Savage, 1954; Siegel, 1956), i.e., 
that any difference in SEU leads to 
100°) preference for the alternative 
with the higher SEU, this assumption 
becomes part of the present model. 


sumptions inherent in method 


measures 


upon assumption 


‘This is a modification of Edwards SEI 


model in that he did not require that the sum 
of subjective probabilities equal unity 
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However, if the utility measures are 
based upon the assumption § that 
preferences are stochastic when utility 
differences are small (Coombs, 1959; 
Davidson & Marschak, 1959; David- 
son, Suppes, & Siegel, 1957; Luce, 
1959; Suppes & Walsh, 1959), then 
the stochastic assumptions become 
part of the present model. 

Figure 1 shows the SEUs of two 
alternatives Y; and Y2 as functions of 
subjective probability. The figure 
illustrates a situation in which you, 
as S, must choose between going to 
the airport (Y,) or going to the Green 
(Y.). In this example you 
want to meet Bill, who will be either 
(B,) at the airport, or (B.) at the 
The satisfactions (utili- 
to derive from the 
four possible outcomes of this situa- 
tion are given by $41, 12, @e1, and do». 

The model predicts that you will 
choose Y, (the airport) whenever your 
subjective probability that Bill is at 
the airport, ¥(B,), is greater than X). 
Note that X, is the subjective proba- 
bility at which the two alternatives 
have equal SEU and intersect. (Ifa 
stochastic model is used to determine 
utilities, Y, will be a range rather than 
Note that the 
of the subjective probabilities, ¥(B,) 
+ y(Bz), is unity, indicating that you 
are sure Bill will be either at the air- 
port or the Green Room, not at both, 


Room 


Green Room. 


ties) you expect 


a point.) also sum 


and not somewhere else. 


MEASURING SUBJECT PROBABILITY 


The subjective probability method 


proposed in this paper is based upon 
the reverse of the above interpreta- 
instead of predicting the 


tion, 1.e., 
the method 
estimates S’s subjective probability 


alternative S chooses, 


at the time of choice. From Figure 1, 
evident that 
Alternative Y,, his subjective proba- 
bility ¥(B,), must be between X, and 


it is when S chooses 





BILL_CHOOSES B 
AIRPORT (BI) |GREEN Rm(62) 


oC!) ¢ (12) 
GREEN Rm (Y2) (2!) $(22) 
9(22)> II> P2I)> oI2) 





YOU CHOOSE Y 





AIRPORT (YI) 




















WHERE 





S(B!) 100 
S(B2) 0.00 


Fic. 1. Hypothetical problem illustrating 


SEU model. 


unity, whereas when S chooses }2, 
y¥(B;) must be between X, and 0. 
Knowledge of both the utilities and 
the choice permits one to place upper 
and lower bounds on W(B)). 

The range between the upper and 
lower bounds can be decreased by 
having S choose from among several 
alternatives. In general, if N differ- 
ent utilities are known, it is possible 
to construct N? different two-outcome 
alternatives. 

Let (a;,P;a,;) represent an alter- 
native, A, that pays a; when p 
occurs and a; otherwise. If u(a;) 
> u(ds) > > u(a,) it follows that 
(a;, P; a) is preferred to all other 
alternatives at all probabilities except 
¥(p) = 1 at which point all alterna- 
tives of the form (a;, p; x) are equally 
preferred and except at the point 
¥(p) = 0 when all alternatives of the 
form (x, P; 4a.) are equally preferred. 
Alternative (a), P; da») is preferred to 
all alternatives of the form (a;, p; x) 
when u(x) < u(d2) and is preferred 
to all alternatives of the form (x, p; a2) 
u(a,). However, there 
are N 1 alternatives of the form 
a,;) which preferred to 
(a1, P; dz) when y(p) is small, but 
are not preferred when y(p) is large. 


when u(x) < 


(%, 2 are 
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A (1) 
<= {Ov (It) 
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| ~" 
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GF EH 
SUBJECTIVE PROBABILITY OF X 
Fic. 2. Example illustrating relationship 
between SEU, subjective probability, and 
alternative choices. 

















1.0 0.0 


Hence, these NV — 1 alternatives inter- 
(a1, P;@2) at various ¥(p). In 
general, alternatives of the 
(a;, P; @;) intersect N — 7 alternatives 
of the form (x, p;y), when u(x) 
< u(a;) and u(y) > u(a;). Figure 2 
shows the nine two-outcome alterna- 
tives that can be formed from three 


sect 
form 


distinct utilities and the nine subjec- 


tive probability intersection points 
that result. In general, N 
utilities will result in NV? two-outcome 


alternatives and N?(N — 1)?/4 inter- 


distinct 


sections. 
Let the subjective probability at 


AVAILABLE PAYOFFS 


SUBJ ECTIVE 
EXPECTED 
UTILITIES 

OF 

PAYOFFS 

AND 
ALTERNATIVES 





+ 


MAXIMUM NUMBER OF OBTAINABLE 
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the point at which Alternative A inter- 
sects Alternative B be y(A, B) 
where A = (a, p; a’) and B= (8, p; b’). 
Since the SEUs of A and B are equal 
at the intersection point, it follows 
that: 


y(p)u(a) + [1 — ¥(p) Ju(a’) 
y(p)u(b) + [1 — wp) Ju(o’) 


and: 


u(a’)—u(b’) 


¥(A, B) u(b)—u(b’) —u(a)+u(a’) 

It isevident that y(1, B)=y(B, A), 
i.e., that reversing the order of the 
alternatives does not change the sub- 
jective probability at the intersection 
point. 

However, reversing the 
(utilities) within an alternative will 
change the crossover point. Let 
Alternative A’ be (a’, p;a@) and let 
Alternative B’ be (b’, p; 6). It follows 
that: 


payotts 


u(a)—u(b) 


, a — 
¥(A’, B’) u(b’)—u(b) —u(a’)+u(a) 


6 _ 

= 

6 

= : er - 
SUBJECTIVE PROBABILITY INTERVALS 





Fic. 3. 


Che number of obtainable subjective probability intervals as a function of the number 


of required choices and the number of available payoffs. 
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Comparing ¥(C1’, BY) with ¥C1, B) 
it will be that C1’, B’) 1 

W(A, B). 

It is also evident, from the above 
equations, that y(A, 1’) £/2, 405, 
reversing the payoffs within an alter- 
native will the 
alternative intersecting the original at 
¥(p) 1/2. In general, N(N — 1)/2 
of the intersections of the two-out- 
come alternatives formed from WN 
unique utilities will intersect at y(p) 

1/2. One need know only the 
ranks of the two utilities used in these 


seen 


result in reversed 


alternatives to determine which alter- 
native has the higher SEU at subjec- 


tive probabilities greater (or less) 
than 1/2. 

The maximum number of unique 
intersections of two-outcome alterna- 
that from N 


unique utilities is therefore: 


(N? — 1)(N? — 2N) + 4 
4 


tives can be achieved 


The maximum number of subjective 
be de- 


probability intervals that can 
termined from N unique utility points 


is one more than the number of unique 


intersections, as shown in Figure 3. 


Utilities Measured on Interval or Ratio 
Scales 


the 
operating at 


for 

subjective probability 
the time of S’s choice is to have S 
select one of several alternatives. For 
example, if S must choose between 
Alternatives £, D, and F 
Figure 2 and he chooses Alternative EF, 


One method estimating 


shown in 


his subjective probability, according 
to the model, must be between ED 
and unity. Whenever S selects Alter- 
native /, his subjective probability 
must be between DF and 0; and when 
S selects Alternative D, his subjective 
probability must be between ED and 
DF. If the utilities, (1), (11), and 
@(II1) are measured on a ratio or in- 


MAKING 


terval scale, the numerical values of 


bounds can be deter- 
the 


these probabilit 


mined from the equations for 
intersections of the alternatives. 

Phe range between the upper and 
lower bounds can be reduced by hav- 
S again choose between alterna- 


EB, 


might be pre- 


ing 
tives. If S's original 
Alternatives EF and C 
Given the initial preference 


choice is 


sented. 
E, if S now selects E over C his sub- 
jective probability must be between 
unity EC, 
C at this point would indicate a sub- 


and whereas a choice for 
jective probability between EC and 
ED. Similarly, Alternatives ~ and F 
might the original 
choice were D, and Alternatives B and 
F might be presented if the original 
Given three utilities, 


be presented if 


choice were F. 
it is possible, from two responses, to 
estimate which of six intervals con- 
tains S’s subjective probability. A 
third choice would permit the experi- 
menter to determine which of eight 
subjective probability ranges contains 
S's subjective probability. 

Another technique for 
increased precision requires only a 
S ranks his prefer- 
alternatives instead 


obtaining 


single response. 
ences for many 
of simply selecting the one he prefers 
most. Figure 3 shows the maximum 
number of intervals obtainable with 
from one to six utility points. Notice 
that the number of intervals depends 
upon the depth of the ranking re- 
the number of 


quired of S and 


utility points used. 


Utilities Measured on an Ordinal Scale 


If only the ranks of the utilities are 
known, at least 2N — 2 ranked proba- 
bility intervals can be obtained. As 
indicated earlier, knowledge of the 
ranks of two utilities is sufficient in- 
formation to enable the experimenter 
to construct a pair of alternatives 
such that S’s choice indicates whether 
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his subjective probability is 
than, equal to, or than 1/2. 
Alternatives B and C, and E and F, 
and G and // (Figure 2) are examples 
of such alternatives. 

In order to obtain 2N — 2 ranked 
probability from N different utilities 


greater 


less 


when only the ranks of the utilities 
are known, construct a set of 2N 3 
two-outcome alternatives. Let 
alternative, AN, winning 
the highest utility, a, when Event p 
occurs, and the lowest utility, 2, 
otherwise, 1.e., 4. N (a,p;n). From 
each of the other N — 2 utilities con- 
Let 
alternative from each utility be a sure 
thing, and let the other alternative 
the next 
utility when p does not occur, e.g., 
from Utility 7 construct Alternatives 
II and JJ where J] (1, p;t) and 
IJ (i, p; 7) and where the rank of 
u(t) is one higher than the rank of 
u(j). It that Alternative 
II intersects AN at a higher sub- 
jective probability than the inter- 
section of AN and JJ, i.e., Y(AN, 11) 
> W(AN,JJ). Moreover, it is evi- 
dent that y¥(AN, BB) > W(AN, BC) 
> ¥(AN, CC) > > W(AN, MM) 
where u(a) > u(b) > --- > 
> u(n). Two additional alternatives 
can be formed to complete the set of 
2N — 2 alternatives. One of 
alternatives, BA (b, p; a), will in- 
tersect .1N at the highest subjective 


one 


consist of 


struct two alternatives. one 


consist of winning lower 


is evident 


u(m) 


these 


probability, and the other alternative, 
NM n, P; m) will intersect AN at 
the subjective 
Therefore, the 2N — 3 
this manner intersect 


lowest probability. 
alternatives 
AN 


3 different subjective proba- 


formed in 
at 2N - 
bilities. The ranks of the subjective 
probabilities at the intersection points 
are strictly determined by the ranks 
from N differ- 
ent utilities it is possible to construct 


a set of alternatives such that 2N — 2 


of the utilities. Hence 


subjective probability intervals can 
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be ranked from a knowledge of only 
the utility ranks. 

A second technique for measuring 
subjective probability when only the 
utility ranks known makes it 
possible to measure both utility and 
subjective probability simultaneously. 
Let the N utilities be ranked as be- 
u(a) > u(b) > --+ > un). 
Construct the following N two-out- 
come alternatives: AN (a, Pp; n); 
BM = (b, p;m); CL = (c, p; 1); -:-; 
NA = (n, p;a). These N alternatives 
intersect, at most, at (N?—2N-+2)/2 
distinct probability points to form 
(N?—2N+4)/2 probability intervals. 
Consider the four 
ranked a highest and d lowest. 
four two-outcome 


are 


fore: 


utilities, 

Let the 
alternatives, 
structed in accordance with the above 
plan, be: A (a, p;d);B (b, p;c); 
C = (c,p;b);and D = (d, p;a). The 
difference between Utilities @ and }, 
i.e., u(a) — u(b), must be less than, 
equal to, or greater than the difference 
between Utilities cand d. But when 
i(a) — u(b) < u(c) — u(d), the ranks 
of the SEUs of the Alternatives A, B, 
C, and D are: 


case of 


con- 


D>C>B>A when 
D=C>B>A when 
C>D>B>A when 
C>D=B>A when 
C>B>D>A when 
C= B>D=A when 
B>C>A>D when 
B>C=A>D when 
B>A>C>D when 
A=B>C>D when 
A>B>C>D when y (A, B)< 


Note that only 8 of the 24 possible 
rankings of A, B, C, D occur when 
u(a) — u(b) u(c) — u(d). 

When u(a) — u(b) u(c) 
the SEU ranks are: 


—_ u(d), 


D>C>B>A when 0 <y¥(p)<1 
A=B=C=D when ¥(p) =1 
A>B>C>D when 1/2 <y(p) <1 


) 
) 


Note that all 24 rankings of A, B, C, D 
are possible when u(a) — (db) u(c) 
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u(d), but only two rankings occur 


when ¥(p) is different from 1/2. 


When x(a) u(b) > 
the SEU ranks are: 


D>C>B>A when 0 
D>C>B=A when 
D>C>A>B whe 
D>C=A>B when 
D>A>C>B whe 
D=A>C=B whe 


u(c) u(d), 


«.<x«.n. ce 


.= 


when 


p> oat 


when 


i 


>B> C=D when 
>D when ¥(C, D 


._— 


A> 

A>B=D>C 
A>B>D>C when 
A> 

A> 


> B> 


a 


Note that only 8 of the possible 24 
rankings of 4, B, C, D occur when 
ula) u(b) > u(c) — u(d). 

From the above, it is evident that, 
when y(p) is different from 1/2, six 
rankings of the 
only when u (a) 


alternatives occur 
u(b) > u(c) — u(d) 
and six other rankings occur only 
u(c)—u(d). Thus 
if S assigns any of these 12 distinctive 
rankings when y(p) is not 1/2, the 
experimenter can immediately rank 
the subjective probability of p operat- 


when u(a)—1(b) « 


ing at that time and, at the same time, 
whether 


u(a) 
greater or less than u(c) u(d). If 
than 


determine u(b) is 


u(b) is (less) 
u(d ), 
never 


u(a) greater 
10 of the 24 rankings 
and if the utility 
differences are equal then 22 of the 24 
rankings will never occur when y(p) 
is different from 1/2. 
subjective probabilities different from 


Uc) 


will occur, 


Therefore when 


one-half are used, the experimenter 
from S 


u(b) is 


determine 
u(a) 

greater than or less than u(c) 
the 
intervals 


can immediately 
rankings, whether 
u(d), 
can immediately order six sub- 
jective probability 
able from the four utilities, and can 


which of 


obtain- 
simultaneously determine 
the six intervals contains the (p) 
operative at the time of S’s ranking. 

Thus, given only the ranks of the 
utilities, it is possible to order the 
utility differences and the subjective 
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probability intervals from S's rank- 
ings of the alternatives. 

llowever, the above method does 
lf S 
ranks the alternatives consistent oniy 
with u(a) — u(b) > u(c) — u(d) at 
one time, and consistent only with 
u(a) — u(b) < u(c) — u(d) at another 
the method will lead to con- 


tradictory results. If people show 


not allow for stochastic choice. 


time, 


such inconsistencies in their rankings 
it will be necessary to modify the 
method by extending it to include 
stochastic rankings of alternatives. 


PRELIMINARY TEST OF 
GENERAL METHOD 


THE 


Two preliminary experiments were 
conducted to explore whether or not 
Ss vary their selection of alternatives 
in accordance with the difficulty of a 
discrimination task and the objective 
probability of an event. A_ visual 
discrimination task was selected be- 
cause it did not contain any explicit 
probability measures, because it pro- 
vided a fairly critical test of the 
method, and because it permitted 
unambiguous control of the stimulus. 
Both experiments were conducted at 
Connecticut College® using the same 
23 undergraduate women. 

Each S was given a set of 45 prob- 
lems from which her utility scale for 
points could be determined by linear 
programing (Davidson). A problem 
consisted of a between two 
alternatives. Each alternative con- 
sisted of winning 1500, 1000, 500, or 
0 points or of losing 500 or 1000 
points if Event ZEJ occurred; and 
winning or losing one of the above 
point scores if Event ZOJ occurred. 
The subjective probability of ZEJ 
(ZOJ) was assumed to be equal to 1/2. 
Linear programing solution could not 


choice 


® The assistance of Mortimer H. Appelzweig 
and George O. Moeller in providing Connecti- 
cut College Laboratory facilities and students 
is gratefully acknowledged. 
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TABLE 1 


Deriven | 


PILITY 


SCALES 


Objective Point Offers 


1000 500 


OM 
\G 
MS 
JN 
BW 
GH 
K ry 


CS 


1875 
1061 
2250 
1500 
1333 


1000 


1000 
643 
1000 
&33 
667 
750 
667 
500 
500 
700 


1333 
1000 
1250 
| Jj 1300 


' 
all utility scales were 


units of all utility scales were arbitrarily set by 


* The zero points of 


utilities of zero and 500 objective points 


The scales for 
10 of the Ss for whom solutions were 
found Table 1. AI- 
though the subjective distances be- 
tween the point offers differed from 
the objective distances for most Ss 
the rank-order of the points and the 
rank-order of the probabilities at 
which alternatives were preferred did 
not differ from the objective for the 
the 


be found for many Ss. 


are shown in 


alternatives used in two ex- 


periments. 


Experiment I: Visual Discrimination 


In the first experiment, S chose one 
the 
to discriminate which of 


of 


her ability 


five alternatives on basis of 
two line clusters contained the greater 
Unknown to S, 
of the two clusters always contained 
100 lines. The other contained 75, 80, 
85, 90, 95, 100, 105, 110, 115, 120, or 
125 lines. The lines were 1/8 to 1/4 
inch long and were randomly 
tributed in an irregular, egg shaped 
pattern about 3/4 inch high and 1% 
inches long. Both clusters were of 
the same irregular shape and area. 
Sixty pairs of these clusters were re- 


number of lines. one 


dis- 


produced in mimeographed test book- 


arbitrarily 


500 1000 1500 


500 
500 
500 
500 
500 
500 
500 
500 
500 
500 


1375 
928 
1000 
1000 
750 
750 
833 
875 
750 


800 


1625 
1142 
1250 
1667 
1083 
1500 
1333 
1125 
1250 
1200 


as 


7 i ee) 


The 


issigning 500 units to the distance between the 


set at zero objective points. 


Each booklet contained the 
same 10 pages with six problem pairs 
per page, but the order of the pages 
was randomized from one S to the 
next. The top cluster in each problem 
pair was labeled QUG and the bottom 
cluster was labeled QOB. Two of the 
QUG areas contained 75 lines, two 
contained 80, 85, 115, 120, and 125 
Four of the QUG areas con- 
tained 90, 95, 105, 110 lines. 
Each of the 28 QUG areas was paired 
with a QOB area containing 100 lines. 
An additional 28 problems reversed 
the QOB and QUG Four 
other problems were also presented 
in which QUG and QOB each con- 
tained 100 lines. 

S was told that the test was de- 
signed to test her ability to estimate 
the number of objects in an area. 
The instructions informed S that she 
would be paid 1 cent for every 500 
points she won, and that she would 
lose 1 cent for every 500 points she 


lets. 


lines. 
and 


areas. 


lost. However, S was guaranteed to 
net at least 1 dollar regardless of her 
She 


properly, win up to 2 dollars. 


losses. could, if she selected 


S won 
points by choosing one of the five 
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alternatives described below for each 
pair of QUG and QOB areas. 


Alternative CIJ: You win 1500 points if 
there are more lines in area QUG than there 
are in area QOB, and you lose 1000 points if 
there are more lines in area QOB. 

Alternative MEQ: You win 1000 points if 
there are more lines in area QUG than in 
QOB, and you lose 0 points if there are more 
lines in QOB. 

Alternative ZEJ: You win 500 points for 
sure. 

Alternative XIP: You win 1000 points if 
there are more lines in area QOB than in 
QUG, and you lose 0 points if there are more 
lines in QUG. 

Alternative FEH: You win 1500 points if 
there are more lines in area QOB than in 
QUG, and you lose 1000 points if there are 
more lines in QUG. 
alternatives shown in 
Table 2, together with the alternatives 
offered in the next two experiments. 

Two lines were drawn to the right of 
each problem in the test booklet. 
One line was labeled Side A, and the 
other was labeled Side B. S wrote the 
name of the alternative she preferred 
If she wished, 


These are 


on* the Side A line. 
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wrote the same name the 
Side B line, or she wrote the name of a 
different alternative. S was told that 
only one of the alternatives indicated 
on the two lines would be considered 
in computing her score. This alterna- 
tive was determined by flipping a coin 
labeled Side A on one side, and 
Side B on the other. If S wrote the 
both lines, she was 
sure to have that alternative count 
her score. If she wrote 
different names on the two lines, one 
or the other would be selected, each 
with a 50/50 chance. This procedure 
distinguished between real preferences 
for an alternative and those forced by 
a choice between equally desirable 


she on 


Same name on 


towards 


alternatives. This method provides 
a seven-interval probability scale as 
shown in Table 3. 

In to S's 


utility for points during the experi- 


order avoid changing 
ment, S was not told which area con- 
tained more lines, which side of the 


coin appeared face up, or her point 


rABLE 2 


ALTERNATIVES OFFERED THI 


Condition 
a 


S chooses column alternative 


Discrimination 
S receives 


If bottom area contains more 


lines, S receives 


| S chooses column alternati 


First set 


If ZEJ card is drawn, 


Probability Deck If ZOJ card is drawn, 


S chooses column alternative. 


Second Set 


If XIH card is drawn, 


Deck 


Probability 


SUBJECT 


If top area contains more lines, 


| If XEH card is drawn, 


BY PROBLEM AND CONDITION 


Alternatives 


CIJ MEQ | ZEJ FEH 


1500 | 1000 | 500 — 1000 





1000 | 500 | 1000 1500 











| 


VUK | WUB XEH YOF 


S receives 1500 1000 500 1000 


S receives 1000 500 | 1000 1500 


QIF ZBJ DJQ 


S receives 1000 1000 1500 


S$ receives 1500} 1000 0 1000 
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score until she had completed all 
three phases of the experiments. The 
utility scale, the discrimination and 
the probability deck problems, dis- 
cussed in the next section, were all 
administered during one group session 
that lasted 1144-2 hours. 

If the SEU model and the method 
for measuring subjective probability 
are appropriate, we expect S to choose 
a high probability alternative (CI] 
CIJ) whenever she is sure that QUG 
contains more lines, to choose FEH 
kEH whenever she is sure QUG con- 
tains less lines, and to vary her choice 
in the seven intermediate steps shown 
in Table 3 as her confidence in her 


Ri SPONSES 


ability to discriminate varies. One 


also expects a high correlation between 
S’s confidence, as measured by the 


subjective probability associated with 
her choice, and the difficulty of the 
discrimination task. Since the ranks 


DISCRIMINATION 


of the subjective probability levels 
do not differ from the objective prob- 
ability levels, for the 10 Ss for whom 
utility scales were obtained (Table 3), 
a high correlation would be expected 
between the ranks of the objective 


r 
Y 
oO 

> 


ASSOCIATED WITH 


probabilities associated with alterna- 
tives selected by S and the difficulty 
of the discrimination task. The mag- 
nitude of the difference between the 
number of lines in the two areas pro- 
vides an objective basis for ranking 
the difficulty of the discrimination. 
Thus, if a significant correlation be- 
tween the ranks of the stimulus differ- 
ences and the ranks of the probability 
intervals associated with the alterna- 
tives is found, it would support the 
notion of using the proposed method 
to measure subjective probability, 


PROBABILITIES 


~ 


or MEQ 


confidence in decisions, and stimulus 
ambiguity or difficulty. 


Response 


Results. Two Kendall coefficients 
of concordance were calculated for 


each S, one for four replications of the 


FEH or NIP 


five problems involving 100, 105, and 
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110 lines in QUG and 100, 105, and 
110 lines in QOB; and the other for 
the four replications of the five prob- 
lems involving 90, 95, and 100 lines 
in QUG and QOB. 
Spearman average rank 
tions obtained from these coefficients 
are shown, for each S, in Table 4. 
The ‘“‘best’’ estimates of each S's 
choice, on each problem, obtained by 
averaging the S’s four 
each problem, were used to compute 
a coefficient of concordance for the 
group as a The 
Spearman rank correlation obtained 
from the coefficient is labeled Total in 
Table 4. Note that 29 of the 46 
correlations are significant beyond the 


correla- 


choices on 


whole. average 


.0O5 level and that the totals are sig- 
nificant beyond the .01 level. 


EXPERIMENT I]: OBJECTIVI 
PROBABILITY 


The results of a second pair of tests, 
which employed explicit probabilities 
rather than line areas, are also shown 
in Table 4. S was told that one card 
would be drawn from a deck contain- 
ing 100 cards. Each card in the deck 
was labeled either ZEJ or ZOJ in the 
first set, and either XIH or XEH in 
the second set. S was told exactly 
how many ZEJ (or XIH) cards were 
in a particular deck. In the first set, 
she selected one of the four ‘‘Prob- 
ability Deck” alternatives shown in 
Table 2 and described below: 


VUK: You win 1500 points if ZEJ is drawn, 
and you lose 1000 points if ZOJ is drawn 
WUB: You win 1000 points if ZEJ is drawn 
and you win 500 points if ZOJ is drawn. 
XEH: You win 500 points if ZEJ is drawn 
and you win 1000 points if ZOJ is drawn. 
YOF: You lose 1000 points if ZEJ is drawn 
and you win 1500 points if ZOJ is drawn. 


The objective probabilities at which 
these alternatives intersect are shown 
in Table 5. In the second set, she 


rABLE 4 


CORRELATIONS BETWEEN OBJECTIV! 
STIMULUS SCALE AND SUBJECTIVE 
PROBABILITY 





Task 





Probability 


Discrimination 
5 | (4 alternatives) 


(5 alternatives) 
yect 


j 
| 
One area | One area 
contains contains 
more less 
than 100 than 100 
(100-105-110)) (90-95-—100) 


38 .64%* | .78** | 
DS a “- 
.66** ar 
55* .83** 
.O8 .83** 
61" _— 
.20 .68** 
_59** .84** 
5" .86** 
30 a" 
_9Q** 89** 
ws aes ao” 
.84** 

5Q** O1** 

A7* a 
‘ 39 | SOP? | 
i=" 18 | .89** | 


a | wa et © 
10 | .39 88** 
31° | oy" .85"° 
.46* — .84** 
.83** .53* .87** 
10 53” a 





.75** 


.834** ii" 





87** |. 
| 


* Significant at .05 level. 
** Significant at .01 level. 


selected one of the four alternatives, 
shown in Table 2 and described below : 


QO|F: You lose 1000 points if XTH is drawn 
and you win 1500 points if XNEH is drawn. 
FH]: You win 0 points if XIH is drawn 
and you win 1000 points if XEH is drawn. 
ZB]: You win 1000 points if XIH is drawn 
and you win 0 points if XEH is drawn. 
DJQ: You win 1500 points if XIH is drawn 
and you lose 1000 points if XEH is drawn 


The probabilities at which these alter- 
natives intersect are shown in Table 5. 
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Kach set of problems was con- 
structed from 19 
These decks contained the following 
numbers of ZEJ-ZOJ(XIH-XEH) 
cards : 0-100, 10—90, 15-85, 20-80, -- -, 
75-25, 80-20, 85-15, 90-10, 100-0. 
Thirty problems were constructed 
from each set of 19 decks by present- 
ing 11 decks twice and 8 decks® once. 
Each set of 30 problems was included 
separate page in the test 
booklet after the discrimination ex- 
periment problems and the _ utility 
scale problems. 

S again indicated the one, or two, 
alternatives she preferred, using the 
same Side A and Side B procedure. 
She was not informed of her point 
score on either set, nor of her point 


different decks. 


On a 


score on the discrimination or utility 
problems during the experiment. 
Results. Wendall tau correlations 
between the objective probability 
ranks of the decks and the ranks of 
the alternatives selected by S, assum- 
ing S is rational, were computed for 
each S on each of the two problem 
Table 


4. The correlations labeled Total are 


sets. The results are shown in 
average Spearman rank correlations 
obtained from Kendall's coefficient of 
concordance. The four entries in 
each row in Table 4 were obtained 
from the S during the 
experimental session. 
Note that all 46 tau 
computed for the probability prob- 
lems are significant beyond the .05 
level, and 45 of the 46 are significant 
beyond the .01 level. The average 
Spearman rank correlations are .87 
for the first set and .76 for the second 
These results indicate that Ss 
tended to select high subjective proba- 
bility alternatives for highly likely 
low subjective 


same same 


correlations 


set. 


events, and proba- 

* The eight decks presented only once were 
the 100/0, 85/15, 80/20, 70/30, 30/70, 15/85, 
10/90, and 0/100. 
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bility alternatives for events of low 
probability. 


Summary of Experimental Results 


The empirical results in both the 
discrimination and probability tasks 
support the use of the proposed 
method as a means for measuring sub- 
jective probability, confidence, and 
and suggest that the method 
may provide a means for measuring 
stimulus ambiguity. The high corre- 
lations also indicate that it is an 
acceptable procedure for use in indi- 
vidual prediction. The method pro- 
vides an estimate of subjective proba- 


bias, 


bility from a single response. 

The fact that the correlations are 
less than 1.0, however, indicates that 
Ss do not consistently choose the same 
response to the same stimulus. This 
may reflect fluctuations in utility 
during the experiment, or may result 
from obtaining subjective probability 
measurements that were more precise 
than those employed by 5S, or the 
results may be due to real changes in 
S's expectancy from one stimulus 
presentation to the next. The fluctua- 
tions may further have been due, in 
this experiment, to the alternatives 
having approximately equal SEU over 
a wide range of discrimination prob- 
lems and objective probabilities. 


CONCLUSIONS 


This paper described several meth- 
ods for obtaining measures of subjec- 
tive probability from a single response. 
One method required a priori knowl- 
edge of S’s utility scale and the other 
that S never rank a set 
Both 


required 
of alternatives inconsistently. 
methods were based on modifications 
of the SEU model and required that 
utility and subjective probability not 
interact. 

Two experiments were conducted 
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to test one of the assumptions basic 
to both methods: the assumption that 
people vary their preference for alter- 
natives according to subjective proba- 
bility. this 
assumption, indicating that if utility 
and not 
interact, estimates of subjective prob- 
ability 


The results supported 


subjective probability do 


can be obtained from a single 
response 

The 
methods must be subjected to further 
The reliability of the 
subjective probability estimates, the 
the 
alternatives presented to S, the rela- 


Several questions remain. 


empirical test. 


effects of increasing number of 
tionship between subjective and ob- 
jective probability, and the validity 
of the basic assumptions of the SEU 


model are still to be determined. 
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THEORETICAL NOTES 


A NOTE ON 


SHELDON 
The 


Tests for stimulus generalization to va- 
riation in stimulus intensity gen- 
erally followed a typical procedure. A 
of three, four, stimulus in- 
tensity values are selected. One group 
of subjects learns a response to the weak- 
est intensity, another group learns to the 
strongest, and then both groups are tested 
for generalized responding to the 
plete set of stimuli. 


have 


set or five 


com- 


The two way gradients found by such 
a procedure have shown a typical asym- 
metry despite a respectable diversity of 
stimulus 
and learning paradigms. A description 
of the asymmetry given by Hull (1949) 
notes points: (a) the 
group trained at the weak intensity value 
s concave downward; (b) The curve of 


types of subject, dimensions, 


five The curve of 


the strong-trained group is concave up- 
The strong-trained group 
shows stronger response to its training 
stimulus than does the weak-trained 
group; (d) The downward curve is 
steeper in slope than the upward curve; 


ward; (Cc) 


(e) The upward and downward curves 
intersect at their midpoints. 


Explanations of the have 
involved an energizing property of CS 
intensity (Hull, 1949), or dif- 
ferential conditioning process involving 
contextual stimuli “".ogan, 1954; Perkins, 
1953). However, the which a 
curve will appear to have depends, of 
course, upon baseline and ordinate scal- 
The purpose of the present note 


asymmetry 


else a 


shape 


ing. 


1 Study performed with the aid of a Ford 
Foundation in Behavioral Sciences research 
grant. The writer wishes to express his 
gratitude to Donald W. Fiske, Austen H. 
Riesen, and Charles C. Spiker for helpful 
critical readings, and to thank Paul 
Grim for computing the asymmetry indices 
of the two figures. 


also 


INTENSITY GENERALIZATION 
AND PROTHETI( 


SCALING? 


H. WHITE 


University of Chicago 


is to point out that: (a) newer psycho- 
physical an alternative to 
the practice, traditional in intensity gen- 
eralization studies, of selecting intensity 
values at equal logarithmic intervals; (b) 
arbitrary logarithmic scaling can produce 
generalization asymmetry resembling the 
typical picture under discussion; and (c) 
rescaling baselines of existing curves in 
terms of magnitude-estimation scales re- 
duces the apparent asymmetry of some of 
the intensity generalization data. 


scales offer 


SCALING OF PROTHETIC DIMENSIONS 


(Stevens, 1957; Stevens & 
Galanter, 1957) has recently presented 
evidence for a dichotomy among stimulus 
Those dimensions along 
which stimulus change appears to take 
place by addition or subtraction of stimu- 
lation prothetic, while 
other dimensions, for which change is to 
extent a matter of substitution, 

metathetic. Prothetic 
generally associated with 
attributes. The stimulus di- 
which asymmetry of gen- 
been found (brightness, 
loudness, and area) are classified 
prothetic, while those which have showed 
approximate symmetry (hue, pitch, locus 
on skin, locus in space, length of line) 
are either metathetic or have been treated 
as though they were. That is, 
mental sets of stimulus values have not 
been logarithmically spaced in the man- 
ner usually employed for prothetic di- 
mensions. 

The logarithmic spacing of a series of 
intensity values arranges them so that 
approximately equal numbers of jnd’s 
fall between successive values in the 
series, assuming that the stimuli so spaced 
are thereby also arranged in even steps 


Stevens 


dimensions, 


are classed as 
some 


are classed as 
dimensions are 
intensity 
mensions along 
eralization has 


as 


experi- 
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of similarity, or psychological distance. 
The plausibility of the assumption has 
led investigators of stimulus generaliza 
tion to space their stimuli in accord with 
the principle; Hovland (1937a) actually 
counted jnd’s to locate stimuli, and later 
studies have used the logarithmic spacing 
as a good approximation of this pro- 
cedure. 

Another psychophysical estimate of ap- 
parent stimulus distances produces an 
alternative scaling. The method of mag- 
nitude estimation requires the observer 
to assign numbers to stimuli in accord 
with what he judges their intensities to 
be. Differences between numerical mag- 
nitude estimations for stimulus intensities 
are typically in better agreement with the 
actual physical energy differences be- 
tween those stimuli than the jnd curve 
might lead one to expect. That is, mag- 
nitude estimations are generally related 
to physical energy values by a power 
function, of curvature than the 
logarithmic jnd function. Some dimen- 
sions even show a linear relationship be- 


less 


iT 
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Notes 


tween magnitude estimation and physical 
value (Stevens & Galanter, 1957). 

While logarithmic (jnd) scaling has 
been traditional and accepted, magnitude 
scaling has some points in its favor. The 
summing of jnd’s to estimate psychologi- 
cal distance rests on an unproven as- 
sumption, historically controversial, that 
jnd’s are equal, summable, “units” of sen- 
sation. Magnitude scaling does not ap- 
pear to require such a critical assumption. 
Further, magnitude scaling, and only this 
psychophysical technique to date, has 
showed some amount of validity by per- 
mitting quantitative cross-modality scal- 
ing predictions (Stevens, Mack, & 
Stevens, 1960). 

Errects OF LOGARITHMIC SCALING 

What is the effect of baseline scaling 
on a_ two-way generalization curve? 
Suppose that five intensity values have 
been chosen so that the distances between 
their logarithms are equal. The line at 
the top of Figure 1 illustrates the place- 
ment of the stimuli, A-E, on the physical 
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Strong- 

Trained Se Sp Se 
Log Spacing 

Fic. 1. 








S,Sp Sc So 
Se Sp 


£ 
Se SuSa 
Actual Spacing 


Linear, symmetrical generalization curves inappropriately drawn over a loga- 


rithmically scaled baseline (Graph A), and drawn over the correctly scaled baseline (Graph 
B). (The training points for the weak-trained group—open circles, dashed line, and the 
strong-trained group—filled circles, solid line, are at the left, and the curves extend right- 


ward. 


The parenthesized numbers are a rough index of asymmetry, representing the 


number of square inches between the curves as originally drawn in 3” X 3” size.) 
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dimension; the traditional assumption 
would be that they mark off approxi- 
mately equal apparent steps. 

A weak-trained group of subjects 
learns a response to S,, while a strong- 
trained group learns to Sx, following 
which both groups are tested for gen- 
eralized responding to the complete set 
stimuli. We will suppose that 
the generalization function is linearly, 
symmetrically related to physical energy 
differences between training and test 
stimulus; the subject generalizes in direct 
proportion to the physical energy dif- 
ferences between stimuli. 

Hypothetical results under the above 
conditions are given in Figure 1, in both 
the A and B graphs. In the A graph, the 
spacing of the stimuli along the abscissa 
is even, following the assumption that 
equal logarithmic distances represent 
equal apparent stimulus change. The 
generalization curves appear to be mark- 
edly asymmetrical, with that for the 
weak-trained group being concave down- 
ward and that for the strong-trained 
group being concave upward. 

In the B graph, the stimuli are spaced 


of five 


according to their physical energy dis- 


tances, and the curves for the weak- 
trained and strong-trained groups are 
symmetrical and linear. 

It is possible that the practice of 
logarithmic spacing, characteristic of 
intensity generalization studies, may have 
analogously led to an observed asym- 
metry. Some support for this argument 
will now be sought in the existing litera- 
ture. 

The analysis of the hypothetical ex- 
ample just given is to serve as a model 
for what follows. In that example, it 
was assumed that apparent stimulus dif- 
ferences were in exact accord with the 
physical energy differences between 
stimuli. For the studies to be considered, 
it will be assumed that apparent stimulus 
differences are best estimated from the 
yielded by the psychophysical 
method of magnitude estimation. 


scales 


Stupies OF INTENSITY GENERALIZATION 


The general format of a study of bi- 
directional intensity generalization has 


been described above. Some distinction 
should be made between a “pure” design 
—in which each subject is tested on only 
one stimulus for generalization—and the 
more common counterbalanced design, 
where each subject is tested on all of the 
stimuli, with different subgroups of sub- 
jects used to counterbalance the order of 
testing among stimuli. Despite the fact 
that the data from counterbalanced de- 
signs are more difficult to interpret— 
being a mixture of generalization, ex- 
tinction, retraining, and generalization 
of extinction effects—such designs have 
been used because of the considerable 
economy in number of subjects which 
they permit. 

The relevant literature 
briefly described: 

1. Hovland (1937a) used a GSR con- 
ditioning procedure, with his stimuli 
being four 1000 cycle tones separated in 
loudness by 50 jnd steps. Separate 
groups of 16 college students were 
trained at his faintest and loudest tones, 
and then tested by a counterbalanced pro- 
cedure. 

2. 


may now be 


3rown (1942) used rats in a 
straight alley approach situation. His 
stimuli were three illuminations of a 
large ground-glass screen at the end of 
the alley. There were two groups of 18 
rats each, tested by a counterbalanced 
procedure. 

3. Grice and Saltz (1950) also used 
rats in a straight alley approach situation. 
Their stimuli were white circles, 
four of them graded in area at logarith- 
mically equal intervals and the fifth at an 
intervening half-step. Testing was by a 
pure procedure, with 15 subjects for each 
test stimulus. Two sets of data are to 
be considered here, the resistance to ex- 
tinction data reported originally, and 
speed data reported later (Grice, 1956). 

4. Miller and Greene (1954) used rats 
in a straight runway avoidance paradigm. 
The stimuli were three loudnesses of a 
buzzer. There were 30 rats, tested by a 
pure procedure. Each data point of the 
extinction 
based upon five animals. 

5. Spiker (1956) tested children on a 
task in which they responded freely dur- 


five 


resistance to measure was 
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lic. 2. The A columns show the results of intensity generalization studies with ab 
scissae scaled to log stimulus energy (or, in Hovland’s case, jnd distances). The training 
points for the weak-trained groups (open circles, dashed lines), and the strong-trained 
groups (filled circles, solid lines) are at the left, and all curves extend rightward. In the 
B columns, the same data are presented over abscissae rescaled in terms of psychophysical 
magnitude estimations. (AII figures have been proportioned to occupy the same square 
area. The parenthesized numbers are a rough index of asymmetry, representing the 
number of square inches between the curves as originally drawn in 3” X 3” size.) 
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His stimuli 
illuminations of a 
ground-glass stimulus window ; 
the stimuli were at logarithmically equal 
foot-candle and the fourth at 
Thirty chil- 
dren were given 12 trials of training (15 
weak-trained and 15 strong-trained), fol- 
lowed by a counterbalanced testing pro- 
cedure; another 30 were given 24 trials 
of training before test. These two train- 
ing groups will be considered separately 
below. In addition, the data of the 
pooled groups on their first generaliza- 
tion test trial will 
relatively pure generalization measure. 

An additional study (Heyman, 1957) 
is clearly relevant in design but will not 
be further considered because the data, 
as tabled, did not permit clear appraisal 
of curve shapes. 

The A columns of Figure 2 show the 
generalization reported for the 
studies. To provide a clearer 
picture of symmetry or 
isymmetry of the curves, the training 
points of both the weak-trained and 
strong-trained groups have been placed 
on the left, and the curves extended right- 
ward. All figures have been _ propor- 
tioned to occupy the square area. 
The scaling along the abscissae of the 
Column A figures is logarithmic, except 
in the case of Hovland, where it is in 
terms of jnd units. 


ing 3-second trial intervals. 


were tour circular 


three of 


intervals 
an intervening half-step. 


be considered, as a 


curves 
abe ve 


the degree of 


same 


rhe ordinates of the Column B figures 
are identical with those of their Column 
\ counterparts; the 
Column B 


abscissae of the 
been 
rescaled in the terms of psychophysical 
magnitude The stimulus 
energy values (£) for each study were 
transformed to magnitude values 
(M) using the power function M = E*, 


figures, however, have 


estimations. 
scale 


where the exponent k is a characteristic 
of the dimension established by psycho- 
physical investigation. For Hovland’s 
loudnesses of 1000 cycle tones, k was 0.3 
(Stevens, 1956). For study, 
as for Spiker’s, brightness of illumination 
called for a k of 0.36 (Stevens & Gal- 
anter, 1957). Miller and Greene’s buzzer 
loudnesses were converted using an equa- 
1958) with 


3rown's 


tion for white noise (Stevens, 
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a k of 0.3. . Finally, Grice’s dimension of 
range of 0.9- 
15 (Stevens, 1957); ak of 1.0 was used 


isual area involved a k 


If the asymmetry of the Column A 
figures were the result of baseline scaling 
their treatment in Column B 
should make them roughly symmetrical. 
extraneous factors in the data 
being considered combine to make this a 
fairly gross specification. 


alone, 


Several 


First, the psychophysical data on which 
the rescalings from 
adult human observers, but three of the 
studies with rats and a 
fourth with children. The Column B 
baselines are therefore only educated 
a proper scale for those sub- 


were based comes 


five were done 


guesses at 
jects. 

Second, the response strengths of the 
weak-trained and strong-trained groups 
were not always equal at their respective 
training stimuli. Generalization curves 
extending from training stimuli of dif- 
ferent height would not be superimposed, 
nor is there any good reason to expect 
them to be parallel. Connected in here 
is an ordinate scaling problem—the prob- 
ability that the same amount of response 
change has different implications starting 
from different levels of responding. 

A third factor which might prevent 
rescaling from bringing about full sym- 
metry applies particularly to counter- 
balanced designs. If the line at the top 
of Figure 1 is examined, it will be seen 
that logarithmic spacing of stimuli causes 
those of weaker intensity to be placed 
relatively closer together than stimuli of 
greater intensity. During repeated test- 
ing, the net effect of this is to produce 
less generalization of inhibition at the 
training value for the strong-trained 
group than for the weak-trained group, 
while loading relatively more generalized 
tail of the 
The strong-trained group 


rf 
] 


inhibition onto the strong- 
trained group. 
consequently shows a gradient 
This 
difference in steepness is not removed by 
It might help account for dif- 


ferences between bright-trained and dim 


steeper 


than does the weak-trained group. 
rescaling. 


trained subjects at their respective train- 
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the factor discussed in the 
preceding paragraph. 

Now, examining Figure 2B, it is seen 
that the rescaling meets with partial suc- 
cess in symmetrizing the curves. Five of 
the curves—the two derived from the 
Grice data, and the three derived from 
the Spiker study—appear to be brought 
closer to symmetry. These curves 
only three completely inde- 
pendent sets of data among them. On 
the other hand, the rescaling of the Hov- 
land, Brown, and Miller and 


studies has worked no improvement. 


ing values, 


five 
represent 


Greene 


DiscussION OF STUDIES 


Despite the fact that Hull advanced his 
five rubrics from knowledge of only the 
Brown and Hovland data, the failure to 
symmetrize either these or the Miller and 
Greene data is, to some degree, tied to 
discrepancies between these studies and 
Hull’s typical picture. Hovland’s weak- 


trained group shows a_ generalization 


curve regularly ascending away from the 
training point; Brown's weak-trained and 


strong-trained curves show 
the Miller curves are exactly opposite in 
curvature to the standard. Only in the 
case of the Miller and Greene data—for 
which WN is relatively low and reported 
response variance quite high—is there a 
clear cause to doubt the reliability of the 
reported curve shapes. Indeed, the as- 
cending trend of Hovland’s weak-trained 
group was reproduced by in 
later sets of intensity GSR generalization 
data (Hovland, 1937b). 

On the other hand, the and 
Saltz, and Spiker data, which on the 
whole provide the best complete approxi- 
mation to Hull’s picture, are most altered 
by rescaling. Their residual asymmetry 
might be due to the extraneous factors 
noted above, to sampling error, or to an 
intensity effect which is only reduced in 
apparent magnitude by rescaling. 

Differentiae of studies not made sym- 
metrical by rescaling might include: (a) 
a large difference between response 
strengths of the dim-trained and bright- 
trained groups to their training stimuli; 
(b) aversive, as opposed to appetitional, 
incentives; (c) auditory, as opposed to 


no overlap; 


him two 


Grice 


an. Notes 


(d) 
opposed to pure, designs; or (¢) a stimu- 
lus set of greater “impact,” whether be- 
cause of intensity or of dominance 
figure against ground. 

What can be said, then, about the re- 
scaling venture? There is no a priori 
reason for preferring one expression or 
picturing of a set of data to another, but 
choices can be made in terms of useful- 
ness for interpretation and communica- 
tion. Here, the limited reduction of 
asymmetry by magnitude scaling may 
usefully clarify where and how much a 
special intensity factor must be invoked. 

There is still evidence for an asym- 
metry of generalization, which may re- 
flect an intensity factor. However, the 
intensity interpretation still faces as a 
difficulty the fact that a majority of the 
half-dozen conditioning studies which 
searched for an effect CS in 
tensity have been unable to find it. 


visual, stimuli; counterbalanced as 


as 


have ot 


SUMMARY 


The typical asymmetry found in studies 
of two-way intensity generalization may 
involve baseline scaling, since all such 
studies have spaced their stimuli at equal 
jnd distances, either directly or by a 
logarithmic approximation. Magnitude 
estimations of stimuli yield a different but 
justifiable power-function spacing. An 
artificial example of a typical asymmetry 
produced by logarithmic spacing is given. 
Rescaling of intensity generalization data 
in the literature to magnitude-scale base- 
lines results in increased symmetry in 
some cases. 
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ALL-OR-NONE VERSUS 


JOAN E. 


INCREMENTAL LEARNING! 


JONES 


Defence Research Medical Laboratories; Toronto, Canada 


The process of learning may be con- 
ceived of as the gradual strengthening of 
connections between events or, alter- 
natively, as the establishment of such 
connections in an all-or-none fashion. 
Estes (1960) presented results which 
suggest that evidence for the all-or- 
none position tends to be obscured by 
the traditional techniques of grouping 
data. In one of a number of miniature 
experiments cited by Estes (1960, pp. 
213-215) paired were re- 
called on successive test trials (T,, Ts) 
after a single reinforced presentation 
(R,). The sequence was R,, Tj, rest, 
Ty. Estes argued that, if an 
mental theory is tenable, the probability 
of a correct response appearing on T, 
should be independent of recall on T}. 


associates 
incre 


An all-or-none theory suggests that cor- 
rect T. should occur only 
by chance correct response 
was given on T, Figure 1). A 

relationship found between 
recall on T, and Toy, i.e., of the 49% 
of cases correct on T, 75% were cor- 
rect on T, (C-C cases) and of the 51% 
of cases incorrect on T, 91% were in- 
correct on Ty (N-N cases). 

Estes’ statement of incremental theory 
forces upon it the assumptions that a 
reinforced presentation results in equal 
increments in associative strength for all 
items in all subjects and that the strength 
of competing responses is equal in all 
cases. His deductions concerning the 


responses on 
unless a 
(see 


close was 


probability of a correct response on Ts 


these as- 
sumptions are not made. Estes contends 
that he has disposed of the point of item 
difficulty and individual differences by 
demonstrating that for his subjects and 
with his reinforcement 
Medical Laboratories 
Report No. 241-4, PCC 
H.R. No. 204. 


do not necessarily follow if 


items a second 


1 Defence Research 
Project No. 241, 
No. D77-94-01-16 


(R,) interpolated between T, and Ts. 
yields approximately the same percentage 
of newly correct responses on Ty, (46%) 
as occurred on T, following R, (40%) 
(Estes, 1960, pp. 215-216). However, 
any prediction of the relative percentages 
of newly correct responses following one 
reinforced presentation and following 
two, involves making assumptions about 
the distributions of item difficulty and 
learning ability. The character of those 
distributions for such a limited number 
of items and subjects can be known only 
empirically. Although the approximate 
equivalence of the obtained percentages 
is consistent with an all-or-none inter- 
pretation, the demonstration does not at 
all preclude the possibility that R, re- 
sults in unequal increments for different 
cases. 

According to the latter view, the asso- 
ciations have achieved unequal strengths 
after R,, hence they do not have equal 
probabilities of appearance. Subclas- 
sifying the .cases into those which do 
(C) and those which do not (N) ap- 
pear on T, biases the probabilities of 
appearance on T. without further re- 
inforcement in favor of the C cases. 
Assuming that there is no forgetting 
between T, and Ts, the C-N and N-C 
represent approximately half of 
which the are at 
about threshold strength and the num- 
ber of cases in each should be approxi- 
mately equal. (The proportions will be 
equal in the special case where the C/N 
split on T, is 50/50.) Forgetting is 
reflected in an increase in the number 
of C-N cases and a decrease in the 
of N-C That is, incre 
mental theory does not have to predict 
the frequency of N-C cases. However, 
it can predict trends in the N-C and C-N 
when the the 
lesser 


cases 


those in associations 


number cases. 


categories conditions of 


experiment produce greater or 
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degrees of forgetting. According to the 
all-or-none theory, forgetting will be 
reflected in an increase of cases in the 
C-N category but the frequency of cases 
in the N-C category will be unaffected 
unless subjects are guessing intelligently, 
in which case it will decrease. 

Another problem which precludes un- 
critical acceptance of Estes’ interpreta- 
tion is the indeterminate theoretical 
chance level. According to an all-or- 
none theory the number of N-C cases 
should be no greater than that expected 
on the basis of sheer guessing. The 
chance value depends on the population 
of responses among which the subject is 
guessing, which might be a number less 
than, equal to or greater than eight, the 
number of items presented, and cannot 
be specified other than empirically. The 
alternative forms of guessing might be 
referred to as “intelligent,” ‘“unintel- 
ligent,” and “stupid” guessing. Further 
information is necessary in order to 
whether the N-C which 
whatever their number, are 
sheer guesses or possible true recalls. 
This information is not available in the 
data as collected by Estes but an ex- 
tension of his procedure may clarify 
why N-C cases do occur and also in- 
dicate whether have been 
established by R,, though not revealed 
on T}. 


evaluate cases 


do occur, 


associations 


Three experiments are reported here. 
The first is a replication of Estes’ experi- 
ment with the trial sequence R,, 1), 
“rest,” Ts, confirming his result. The 
second is a variation and extension of 
Estes’ procedure, the trial sequence being 
R,, T;, To, Ts, Ty. The third is a repe- 
tition of Experiment IT using a double- 
blind technique. 

EXPERIMENT | 
housewives acted as 
Experiment I. 
associate-pairs, a 


‘Twenty-seven 
subjects in Each of 
eight consonant as 
stimulus and a number as response, were 
shown on separate 3” X 24%” cards. The 
responses were the numbers | through 8. 
The experimenter sat at a table op- 
posite the subject and presented the 
cards manually after shuffling. On R, 















































One 
Reinforcement Test Rest Test 


Fic. 1. Schema representing effects of 
a single reinforcement according to in- 
cremental (upper branch) vs. all-or-none 
(lower branch) theories. (Squares rep- 
resent subjects with the proportion of 
darkened area in each indicating the prob- 
ability of the correct response, C, for the 
given individual) (from Estes, 1960, p. 214.) 


the cards were each held while the sub- 
ject read aloud the consonant and the 
number. On test trials cards bearing 
only the consonant stimulus were shuffled 
and then presented. The test cards were 
held until the subject responded with the 
number she though correct or with a 
No omissions were allowed. Ts 
followed T, after approximately 7 min- 
utes during which time the subject took 
part in another experiment which in- 
volved sorting bearing lines of 
different lengths and orientations. There 
was no warning that a second test would 
be made. 
Experiment | 
cation of Estes’. 


guess. 


cards 


was not an exact repli- 
Apart from the usual 
experimenter, subject, 
the present experiment 
Estes’ in four respects: 
present experiment 
rather than three-consonant 
were used as stimuli. (0b) 
3ecause the cards were shown manually 
the presentation and between-presenta- 
tion duration were variable whereas 


differences of 
time, and place 
differed 
(a) In the 


from 
con 
sonants 


syllables 
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Estes timed both of these intervals. (c) 


No omissions were allowed whereas 
Estes was forced to accept some omis- 
sions because of the restricted intertrial 


interval. (d) The “rest” period between 


T, and T, was longer than that allowed 
by Estes and occupied with a different 


R, T, T, T; 
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Fic. 2. Results of Experiment II and 
Experiment III (in parentheses). (Empit 
ical values are proportions of instances in 
which correct, C, and incorrect, N, responses 
on a given test trial were followed by 
C and N responses on the following test 
trial.) 
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activity. From Estes’ full report of his 
experiment (Estes, Hopkins, & Crothers, 
1960) it appears that the “rest” interval 
was brief (estimated at 14% minutes) and 
was used for a reinforced presentation 
of other items from the list. The prob- 
able unimportance of these procedural 
differences can be from the 
results of Experiment I which are es- 
sentially the same as those obtained by 
Of the 53% of cases correct on 
, 75% were correct on Ty (C-C) and 
of the 47% not correct on T, 81% were 
not correct on Ty, (N-N). 


assessed 


Estes. 


EXPERIMENT II 


The subjects for Experiment II were 
the same 27 women who served in 
periment I. Both experiments were con- 
ducted in the same session, Experiment 
II first, and were separated by approxi- 
mately 7 minutes during which time the 
subject took part in the 
experiment. The procedure was as de- 
scribed for Experiment I except that 
the test cards were reshuffled and re 
presented immediately until four test 
trials were completed. The experimenter 
gave no indication beforehand that more 
than test would be The 
stimuli were consonants, none of which 
appeared in Experiment I. The re- 
again the numbers 1 


Ex- 


card-sorting 


one made. 


sponses were 
through 8. 
The probability of recall on To4 
largely depends on whether the correct 
response is given on T, (see Figure 2), 
again confirming Estes’ findings. 
ever, features of the results are difficult 
to reconcile with an all-or-none theory. 
Although the percentage of N-C cases 
(29%) is well within the range of 
chance variation, it is necessary 
plain the tendency to repeat 
guesses on Ts, and T,, i.e., N-C-C and 
N-C-C-C cases. One would expect all- 
or-none theory to require the N-C-C 
and N-C-C-C values to approximate 
.29, i.e., the guessing rate established 
on Ts. The incremental interpretation 
demands that those values approximate 
50, these cases being at threshold 
strength. The obtained values of .61 


How- 


to ex 
those 
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rABLE 1 


[TRENDS IN C-N AND 


N-C Cases AS A FUNCTION OF 


COMPARISON OF N-C VALUES WITH 
OF INTELLIGENT GUESSING (g) 


Percentage correct 


Retention 
index T:/T: 


Experiment 


T: 


Ji 


48 


Estes 


and .77 are too high for either of these 
interpretations. These cases cannot be 


fully accounted for as part of a general 
tendency to repeat earlier calls on later 


trials. Of those who change their call 
on Ts, having been wrong on T,, 61% 
of those correct repeat the second call 
and 36% while of those 
again incorrect only 37% repeat the 
second call and 14% twice. If 
the N-C cases are regarded as guesses it 
is necessary to explain why those calls 
are repeated with a much higher prob- 
ability than incorrect responses. If they 
are regarded as true recalls the expected 
N-C-C and N-C-C-C values are .50, or 
less (if forgetting occurs), but in fact 
the values increase. 

rhe implication is that there is some 
kind feedback which distinguishes 
past responses correct or incorrect. 
One possible source of such information 
under the conditions of Experiment II 
is the experimenter, who knows whether 
the response is correct or not. Experi 
III conducted to check this 
possibility. A new group of 24 subjects 
drawn from the same population learned 
the items used in Experiment II. One 
experimenter presented the paired-asso- 
ciates on R, and a second experimenter, 
unaware the cerrect 
sociations, presented the test cards on T, 
to Ty. The results of Experiment III 
appear in brackets in Figure 2. Again 
a high proportion of calls correct for 
the first time on Ty were repeated on 
T, and T4—a proportion considerably 
in excess of repetitions of incorrect calls. 
Of those correct on Ty 53% repeat the 


once twice 


once 


of 


as 


ment was 


who was of as- 


1.04 


91 


.80 


FORGETTING AND 
THE Basis 


A 


THOSE EXPECTED ON 


N-C 


.29 
19 
.09 





response once and 40% twice, while of 
those incorrect only 32% repeat the call 
once and 12% twice. 

If the feedback does not originate 
from an unwitting experimenter then it 
must be somehow induced by the re- 
sponse itself. Bearing in mind that the 
associations in question are assumed to 
be at threshold strength and also the 
fact that recognition is easier than recall, 
it seems not unreasonable to suggest that 
in some cases the subject may recognize 
that the response she has just given as 
a guess was actually correct. Such an 
interpretation renders quite conceivable 
the notion that learning can occur on 
test trials. Estes et al. (1960) in fact 
noted such a phenomenon. However, the 
interpretation suggested does require 
that an association of liminal strength be 
set up on R, and is therefore incon- 
sistent with an all-or-none interpreta- 
tion. 

Further, comparison of the results of 
Estes’ experiment with those of Ex- 
periments II and III shows an effect 
which is also handled more readily by an 
incremental theory. If the total number 
of correct responses is taken as an index 
of retention then it is apparent that 
forgetting between T, and Ty is greatest 
in Estes’ experiment and least in Experi- 
ment III (see Table 1). The trend is 
consistent with the fact that Estes’ sub- 
jects learned other associations between 
trials while the subjects in Experiment II 
sorted cards and in Experiment III there 
no interpolated activity. From 
lable 1 it is also seen that as forgetting 
increases so the proportion of cases in 


was 





160 


the C 
by both theoretical approaches, but also 
that the proportion in the N-C category 
decreases. A x” test based on the fre- 
quencies in the N-C categories showed 
the latter effect to be significant at the 
001 level of confidence. The effect can- 
not be accounted for in terms of an in- 
crease in the population of 
among which the subject is guessing. 
The expected N-C values for intelligent 
guessing were calculated as follows. If 
C is the proportion correct on T,, c the 
probability that an item will be learned 
on R,, and g the probability that the 
correct response will be guessed, then 
the equation for C (Estes et al., 1960) 


N category increases, as predicted 


responses 


1S: 


>=c+ (1—c)g [1] 
Substituting the obtained value of C 
and an arbitrary value of g (with the 
limitation that g << C) into Equation 1, 
for c. A estimate of g is 
from: 


solve new 


obtained 


1 


n(l — C) 


where n is the number of items in the 
list. The new estimate then 
substituted into Equation 1 and _ the 
procedure reiterated until a stable value 
of g is obtained. The g values obtained 
after three iterations are shown in 
rable 1. A  y? test of the obtained N-C 
values against the frequencies predicted 
for intelligent guessing yielded a p of 
.001. That is, the drop in the N-C fre- 
quencies as a function of forgetting ex- 
ceeds that expected simply as a decline 
in the chance level. 

The data of Experiment II strongly 
suggest that subclassifying the cases 
according to their history on successive 
trials than sort them in 


ot ¢g¢ 1S 


does no more 
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order of difficulty, each case being one 
item for one subject. ‘The probability of 
appearance of a correct response on ‘I’, 
is: for formerly correct on all 
three trials (C-C-C cases), .90; for 
cases formerly correct on two of three 
occasions (C-C-N, C-N-C, and N-C-C), 
50, .38, respectively; correct 
on one of three occasions (C-N-N-, N- 
C-N, and N-N-C), .20, .18, and .09 
respectively; for those never before cor- 
rect (N-N-N), .05. A similar ordering 
can be observed in the data of Experi- 
ment III. According to the all-or-none 
position the T, probabilities should be 
the same for all categories with the ex- 
ception of C-C-C. 

The writer appreciates Estes’ state- 
ment that he does not mean to offer the 
experiment he reports as crucial tests of 
incremental all-or-none_ theory 
and agrees with his thesis that theorists 
should examine closely the extent to 
which their concepts are artifacts of ex- 
perimental procedures. But insofar as 
the minature experiments are used as 
demonstrations, they are taken to create 
difficulties for incremental theory. Con- 
sidered in the context of additional in- 
formation from the experiments reported 
in this paper these same data raise dif- 
ficulties for all-or-none theory. The 
criticism directed at the miniature ex- 
periments is not that they are too simple 
but that they are incomplete. 


cases 


and .77 


versus 
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